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FOREWORD 


The  Second  Army  Conference  on  Applied  Mathematics  and  Computing 
was  held  o»-22«25  May  190>in  the  Communications  Center  at 
Rensselaer  Polytechnic  Institute.  It  featured  several  current 
research  areas,  including  robotics,  continuum  mechanics,  and 
innovative  computational  methods.  Also  a  special  session  was  q 
held  on  constitutive  equations  for  high  strain  rate  problems 
The.  conference  was  initially  planned  for  three  days,  but  in  order 
to  accommadate  the  unusually  large  number  of  contributed  papers, 
namely  seventy- three,  an  additional  day  was  scheduled. 

The  Army  Mathematics  Steering  Committee  (AMSC)  is  the  sponsor  of 
this  conference,  and  it  had  as  its  hosts  the  Benet  Weapons 
Laboratory  and  Rennselaer  Polytechnic  Institute.  Drs.  San  Li  Pu 
and  John  Vasil akis  served  as  representatives  for  the  Benet 
Weapons  Laboratory,  while  Professors  Donald  A.  Drew  and  Joseph  E. 
Flaherty  were  the  local  chairpersons  for  the  other  host.  Members 
of  the  AMSC  take  this  opportunity  to  thank  these  gentlemen  for 
all  their  time  and  work  preparing  for  and  conducting  this  well 
run  scientific  meeting. 

The  Series  of  annual  meetings  entitled  Army  Conferences  on  f 
Applied  Mathematics  and  Computing  combines  two  former  symposia,  / 
namely  the  Conferences  of  Army  Mathematicians  and  the  NumericajK 
Analysis  and  Computers  Conferences.  Picking  topics  to  be  empha¬ 
sized  at  the  present  conference,  the  organizing  committee/ 
selected  areas  of  research  that  span  the  fields  of  the^darlier 
conferences.  This  point  is  well  brought  out  by  the^fol lowing 
list  of  invited  speakers  together  with  the  titles  of  their 
addresses.  / 


SPEAKERS  AND  AFFILIATIONS 


TITLE  OF  ADDRESSES 


Professor  Ferdinand  Freudenstein 
Columbia  University 

Professor  George  C.  Sih 
Lehigh  University 


Computer-Ai ded  Mechani sms 
Analysis  and  Design^ 

Scaling  of  Size  and  Time 
Associated  with  Damage 
Prediction > 


Professor  John  W.  Hutchinson 
Harvard  University 


Methods  for  Analyzing  the 
Mechanical  Properties 
of  Nonlinear  Two  Phase 
Composite  Material s^  _>  c^d 


Professor  H.  T.  Kang 
Carnegi e-Mellon  University 


Professor  John  Hopcroft 
Cornell  University 


Parallel  Computations, 
Computational  Complex¬ 
ity  and  Very  Large 
Scale  Integration  ^ 


Mathematical  Foundations 
for  Robotics j 


Professor  D.  P.  Bertsekas  Distributed  Asynchronous 

Massachusetts  Institute  of  Algorithms  .  j 

Technology 

Professor  A.  Jameson*  Computational  Methods  for 

Princeton  University  for  Transonic  Flows, 


Members  of  the  AMSC-  were  very  pleased  with  the  number  and  the 
quality  of  the  papers  presented  at  this  conference.  They  were 
also  pleased  to  have  so  many  of  said  papers  submitted  for  pub¬ 
lication  in  the  Transactions  of  this  meeting.  These  interesting 
and  Informative  articles  can  reach,  in  this  printed  form,  persons 
who  were  unable  to  attend  the  conference;  and  for  those  In 
attendance  this  technical  manual  offers  an  opportunity  to  study 
in  depth  their  contents. 


*Ws  are  sorry  that  Professor  Jameson  was  unable  to  present  his  paper 
at  this  meeting. 
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ABSTRACT.  The  modern  development  of  the  subject  of  mechanisms  has  been 
influenced  heavily  by  high-speed  computation.  This  is  due  primarily  to  the 
nonlinearity,  complexity  and  variety  of  the  mechanical  elements  involved. 
Recent  developments  and  research  in  this  area  will  be  reviewed  with  emphasis 
on  conceptual  design,  kinematic  analysis  and  synthesis  and  dynamic  analysis. 

I,  INTRODUCTION.  The  subject  of  mechanisms  is  in  some  respects  very  old, 
while  in  others  it  is  remarkably  new.  Mechanical  invention  is  already  in 
evidence  in  the  earliest  civilizations.  The  evolution  of  mechanical  design 
from  a  purely  intuitive  "mechanic  art"  into  an  engineering  discipline 
constitutes  one  of  the  significant  current  engineering  developments.  We  begin 
this  review  with  a  survey  of  the  creative  phase  of  mechanisms  design. 

II.  THE  CONCEPTUAL  DESIGN  OF  MECHANISMS,  The  creative  phase  of 
mechanisms  design  is  probably  the  most  elusive  and  challenging.  Attempts  to 
systematize  this  process  traditionally  involve  the  functional  classification 
of  mechanisms,  such  as  atlases  of  mechanisms.  Beginning  with  C.  Babbage  in 
1826  (2)  isolated  attempts  have  appeared  aimed  at  the  development  of  a 
systematic,  abstract  and  useful  representation  and  classification  of 
mechanisms  (e.g.  1,8,14).  Beginning  in  1964-65  graph  theory  was  Introduced 
for  the  representation  of  kinematic  structure  (3-6,9-13,15).  In  this  approach 
links  are  represented  by  vertices,  joints  by  edges  and  the  edge  connection  of 
vertices  corresponds  to  the  joint  connection  of  links,  edges  being  labeled 
according  to  joint  type  and  the  fixed  link  being  identified  as  well.  For 
example  the  familiar  swinging-block  mechanism  and  its  graph  are  shown  in 
Figs,  la  and  lb,  respectively.  In  the  latter  figure  the  number  of  each  vertex 
corresponds  to  the  link  identification  of  Fig.  la;  the  symbols  RtP  denote  pin 
joints  and  sliding  joints,  respectively;  and  the  small  circle  around  vertex  2 
represents  the  fixed  link  identification. 

Graphs  can  be  defined  analytically  and  hence  stored  in  the  memory  of  a 
computer.  The  enumeration  of  mechanisms  can  thus  be  reduced  to  a 
combinatorial  problem  in  graph  enumeration  with  different  mechanisms 
corresponding  to  non-isomorphic  graphs.  In  this  way  it  is  possible  to  create 
mechanisms  in  a  systematic  manner  with  only  minimal  assumptions  regarding 
their  nature  (such  as  the  degree  of  freedom  of  the  motion,  the  number  of 
moving  elements  and  the  admissible  joint  types).  In  this  approach  to  creative 
design,  therefore,  we  first  concentrate  on  kinematic  structure,  enumerate 
potentially  useful  mechanisms  and  then  evaluate  these  in  the  light  of 
functional  considerations. 


We  illustrste  the  procedure  in  a  specific  case:  the  development  of  c 
varianl e-stroke  engine  mechanism  (11).  A  variable-stroke  engine  would  have 
potentially  improved  fuel  economy  due  to  the  fact  that  lead  control  is 
achieved  by  varying  piston  stroke,  thus  eliminating  inlet -throttling  and 
reducing  pimping  losses  during  shore-stroke  operation.  Hence,  in  this  case  a 
mechanism  is  desired  for  converting  rotary  to  variable-stroke  reciprocating 
motion  while  maintaining  a  constant  or  nearly  constant  compression  ratio. 

In  this  problem  we  consider  single-degree-of-freedom  mechanisms  with  pin 
joints  and/or  sliding  joints  and  we  limit  the  search  to  plane  mechanisms.  The 
desired  mechanism  must  be  adjustable  so  as  to  provide  stroke  control  while 
maintaining  the  desired  compression  ratio,  favorable  force  transmission  and 
acceptable  dynamic  characteristics  under  all  operating  conditions.  From  the 
standard  degree-of -freedom  equation  for  mechanisms  it  follows  that  we  need  to 
search  for  mechanisms  with  (4+2n)  links  and  (4+3n)  joints  where  nHa 
positive  integer.  To  avoid  undue  complexity  we  restrict  the  search  to  n=l  or 
2.  Even  so  the  number  of  mechanisms  created  in  the  search  is  in  excess  of  one 
hundred.  Restrictions  as  to  the  number  of  admissible  sliding  pairs,  floating 
links  etc.  limit  the  search  to  about  40  mechanisms.  Fig.  2  shows  some  of  the 
graphs  associated  with  six-link  configurations  (n=l)  and  sketches  of  the 
corresponding  variable-stoke  mechanisms,  while  Fig.  3  shows  the  same  for  the 
eight-link  structures  (n=2).  Tne  mechanisms  were  then  evaluated  on  the  basis 
of  functional  requirements.  Known  configurations,  such  as  prior-art  patents, 
were  recreated,  thus  providing  a  valuable  check  both  on  the  search  as  well  as 
on  the  prior  art.  Three  mechanisms  were  found  which  were  potentially 
acceptable.  Of  these  the  most  favorable  was  judged  to  be  the  mechanism  shown 
in  Fig.  4,  which  was  then  investigated  in  greater  detail.  The  final  design  of 
this  configuration,  shown  in  Fig.  5,  was  awarded  a  U.S.  patent  (#4,270,495, 
1981). 

This  approach  has  been  used  with  success  in  recent  times  in  several 
applications  including  the  development  of  novel  casement  window  mechanisms  (7) 
and  wheel  dampers  (16).  Current  research  efforts  are  concerned  with  the 
partial  automation  of  this  process.  This  involves  automation  of  the 
determination  (enumeration)  of  the  kinematic  structures  or  graphs  satisfying 
the  search  specification  and  the  dimensioning  and  mechanical  analysis  of  the 
mechanism  created  in  the  search.  The  latter  would  include 
interactive-graphics  capabilities  and  potentially  also  artificial -intelligence 
techniques  (expert  systems)  which  would  permit  the  evaluation  of  individually 
tailored  search  specifications  and  functional  evaluation  criteria. 

This  approach  to  conceptual  design  also  has  important  implications  in  the 
evaluation  of  patents  and  in  recognizing  similarities  and  differences  in 
mechani sms . 


III.  KINEMATIC  AND  DYNAMIC  WALYSIS  OF  MECHANISMS.  The  computer-aided 
kinematic  and  dynamic  analysis  of  mechanisms  has  been  greatly  aided  by  the 
development  of  large-scale  d.esign  codes.  These  solve  the  kinematic 
loop-closure  equations  of  mechanisms  to  determine  displacements,  velocities 
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anti  accelerations.  The  dynamics  is  determined  from  the  Lagrangian  equations 
of  motion,  usually  in  variational  form.  In  this  way  complicated  mechanisms 
can  be  analyzed  economically  and  design  alternatives  evaluated  by  parameter 
variation.  Tnis  has  made  possible  the  economical  analysis  of  both  mechanisms 
and  mechanical  systems  of  extraordinary  complexity  in  the  design  stage. 


Table  1  summarizes  the  basic  characteristics  of  some  major  design  codes. 


Table  1;  Codes  for  the  kinematic  and  dynamic  analysis  and  synthesis  of 
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CODE 

DEVELOPER 

BASIC  CAPABILITY 

SOME  TYPICAL 

■muimvm 

ADAMS 

:-i.A.  Chace  and 
associates  (U.  of 
Michigan;  Me¬ 
chanical  Dynamics 
Inc.) 

Kinematic  and  dyna¬ 
mic  analysis  of 
three-dimensional 
mechanisms  and  me¬ 
chanical  systems. 

Vehicle  dynamics, 
agricultural  equipment, 
general  machinery, 
mechanisms,  robotics. 

T57S355 

E.J.  Haug  and 
associates 
(U.  of  Iowa) 

Kinematic  and  dyna¬ 
mic  analysis  of 
plane  and  three- 
dimensional  me¬ 
chanisms  and  me¬ 
chanical  systems. 

Complex  vehicles, mecha¬ 
nisms,  agricultural 
machinery,  intermittent 
mechanisms,  large-scale 
mobile  equipment. 

MW 

M.A.  Chace  and  Kinematic  and  dyna- 

Associates  (U.  of  mic  analysis  of 

Michigan;  Meehan-  plane  mechanisms 

ical  Dynamics  Inc.)  and  mechanical 

systems. 

Accident  reconstruction, 
general  machinery, 
nuclear  core  simulation, 
aircraft  mechanisms, 
vehicles. 

~W 

J.J.  ‘Jicker  and 
associates  (U.  of 
Winconsin) 

Kinematic  and  dynamic  Automotive  suspensions, 
analysis  of  plane  and  sewing  machine  drives, 
three-dimensional  vehicle  dynamics, 

mechanisms  and  mecha-  general  machinery,  ro- 
nical  systems.  botics. 

kYNSYM 

R.E.  Kaufman 
(U.  of  Washington) 

Kinematic  synthesis 
of  plane  linkage  me¬ 
chanisms  (determi¬ 
nation  of  mechanism 
proportions  for  a 
particular  motion 
sequence  via  a 
microcomputer  setup) 

Sizing  of  plane  linkage 
mechanisms  so  as  to 
obtain  a  prescribed 
motion  sequence,  e.g. 
in  production  processes 
general  machinery  etc. 

• 

UNCAGES  A.G.  Erdman 

(U.  of  Minnesota). 

Plane  kinematic 
analysis  and  synthe¬ 
sis  of  linkage 
mechanisms. 

Variable-speed  drives, 
window  actuators, 
biomechanical  devices, 
linkage  mechanisms. 
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Mechanisms  which  have  been  analyzed  with  the  aid  of  computer-aided 
techniques  in  recent  years  al *•«.  .nclude  artificial  limbs,  complex  planetary 
gear  trains,  shaft  couplings  and  many  others. 

In  addition  to  the  computer-aided  design  of  mechanisms  and  mechanical 
systems,  a  substantial  area  of  research  and  development  is  concerned  with 
basic  mechanical  components.  As  we  shall  see  in  the  following  remarks,  this 
subject  is  still  filled  with  challenges. 

IV.  MECHANICAL  COMPONENTS.  Despite  the  sophistication  of  the  analysis 
capability  in  mechanisms  and  mechanical  systems,  there  are  important  areas  in 
which  our  understanding  of  mechanisms  remains  lacking.  For  these  the 
development  of  effective  computer-aided  design  methods  hinges  on  a  better 
understanding  of  their  fundamental  mechanics. 

Amongst  the  mechanisms  in  th'S  category  there  are  many  basic  machine 
components,  such  as  cam-follower  systems,  universal  joints,  constant- velocity 
shaft  couplings,  variable-speed  transmissions  and  others.  Many  of  these  were 
established  long  ago,  as  were  the  simpler  aspects  of  their  motions.  These 
components  are  skillfully  proportioned  on  the  basis  of  many  years  of 
experience,  as  well  as  life  and  wear  testing.  The  sizing  of  these  components 
in  the  design  stage,  however,  remains  a  challenging  objective.  As  technology 
continues  to  advance  at  a  rapid  pace,  mechanical  devices  are  expected  to 
operate  reliably  at  ever  increasing  speeds  and  loads  and  the  rational  and 
predictive  design  of  these  mechanisms  has  become  increasingly  necessary. 
Amongst  other  things  this  includes  the  determination  of  the  internal  force 
and  torque  reactions  of  these  components  under  both  static  and  dynamic  loading 
and  the  ability  to  predict  limiting  speeds  before  the  onset  of  unacceptable 
wear  rates  and/or  destructive  impacts  and  overloads. 

In  belt  drives,  for  example,  the  exponential  belt-tension  equation,  which 
was  derived  by  Euler,  is  still  representative  of  the  state  of  the  art  in  many 
aspects  of  this  field.  In  universal  joints,  the  internal  force  and  torque 
reactions  are  not  as  yet  properly  understood.  In  many  constant-velocity  shaft 
couplings  there  are  a  substantial  number  of  ball  or  roller  elements  with 
complex  motions  involving  line  or  point  contact.  Their  analysis  is  only  just 
beginning.  The  development  of  the  fundamental  mechanics  of  these  components 
presents  exciting  challenges  in  rigid-body  mechanics,  elasticity  theory  and 
stress  analysis.  Eventually  such  efto.ts  can  be  expected  to  lead  to  more 
efficient  and  powerful  computer-aided  design  procedures,  which  in  turn  will 
accelerate  utilization  of  these  components  under  more  severe  operating 
conditions. 

Another  area  which  is  still  largely  unexplored  is  the  logical  design  of 
mechanisms.  Many  mechanisms  perform  sohisticated  logical  functions  e.g.  in 
production  machinery  involving  decisions  concerning  sequences  and  timing  in 
cutting,  loading,  transfer  operations  etc.  This  subject  has  received  some 
attention  in  recent  years  (12),  but  its  potential  remains  to  be  recognized  by 
the  engineering  profession. 


And  finally  let  us  consider  the  area  of  three-dimensional  mechanisms. 
Although  much  progress  has  been  made,  especially  with  computational  approaches 
reducing  the  complexity  and  tediousness  of  their  design  analysis,  much  remains 
to  be  done.  For  example,  the  skew  four-bar  linkage  (Fig.  6)  is  a  basic 
spatial  linkage  used  to  connect  non-parallel  non-intersecting  shafts.  It  is 
known  from  experience  that  depending  on  proportions  the  input  and  output  links 
function  either  as  cranes  (i.e.  can  make  complete  rotations)  or  rockers  (i.e. 
can  only  oscillate).  An  analysis  of  the  dimensional  restrictions  which  govern 
the  rotatability  of  the  links  is  both  basic  and  extremely  difficult. 

In  the  area  of  robotics  the  analysis  of  workspace  represents  another  very 
active  area.  Workspace  is  a  complicated  function  of  three-dimensional  linkage 
geometry.  The  determination  of  robotic  workspace  for  given  linkage 
proportions  can  be  handled  by  the  previously  mentioned  three-dimensional 
codes.  But  the  synthesis  of  robot  dimensions  for  optimum  workspace 
characteristics  is  as  yet  in  the  research  stage.  As  the  fundamental  mechanics 
of  these  areas  continues  to  develop  we  can  expect  corresponding  advances  in 
their  computer-aided  design. 

V.  CONCLUSION.  This  has  been  a  brief  survey  of  trends  in  the 
computer-aided  design  of  mechanisms.  Many  of  the  subjects  which  have  been 
discussed  (e.g.  the  conceptual  design  of  mechanisms)  involve  areas  of  research 
in  which  the  author  and  his  students  have  been  involved.  A  significant  part 
of  this  research  has  been  sponsored  by  the  Army  Research  Office.  The  author 
would  like  to  take  this  opportunity  to  express  his  appreciation  for  this 
support. 
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FIG. 2 


STRUCTURAL  AND  SCHEMATIC  DRAWINGS  OF  SOME  SIX-LINK 
VARIABLE-STROKE  MECHANISMS 

Reprinted  with  the  permission  of  the  American  Society  of 
Mechanical  Engineers  from  “Development  of  an  optimum  variable- 
stroke  internal -combustion  engine  mechanism  from  the  viewpoint 
of  kinematic  structure"  by  F.  Freudenstein  and  E.R.  Maki. 
Trans.  ASME;  J.  of  Mechanisms,  Transmissions  and  Automation 
in  Design  105,  1983,  pp.  259-266. 


FIG.  4. 

OPTIMUM  VARIABLE-STROKE  MECHANISM 


Reprinted  with  the  permission  of  the  American  Society  of 
Mechanical  Engineers  from  "Development  of  an  optimum 
variable-stroke  internal -combustion  engine  mechanism  from 
the  viewpoint  of  kinematic  structure"  by  F.  Freudenstein 
and  E.R.  Maki,  Trans. ASME,  J.  of  Mechanisms*  Transmissions 
and  Automation  in  Design  105,  1983,  pp.  259-266. 
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FINAL  DESIGN  OF  OPTIMUM  VARIABLE- STROKE  MECHANISM  OF 
FIG. 4  (U.S.  PAT.  #4,270,495;  1981) 
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VALIDATION  OF  A  MULTILAYERED  VISCOELASTIC  SEISHIC  PROPAGATION 
MODEL  FOR  SHORT  TO  MEDIUM  RANGES 


Sen  L.  Carnes,  P.E. 

Environmental  Systems  Division 
Environmental  Laboratory 
U.S.  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  MS  39180 

Abstract.  Field  tests  were  conducted  to  provide  a  complete  data  base  for 
validation  of  the  WES  Seismic  Propagation  Model.  The  model  waa  developed  in 
1973  to  predict  seismic  signals  resulting  from  a  variety  of  seismic,  sources, 
but  had  not  been  validated  for  short  to  medium  ranges  (up  to  19  km) .  Model 
predictions  compared  favorably  with  tests  results  and  after  a  refinement  of 
site  physical  properties  and  internal  damping  factor  (or  g),  calculations  were 
even  more  compatible.  The  model  will  now  be  available  for  making  predictions 
at  other  sites  with  a  greater  degree  of  reliability.  The  validation  procedure 
included  preliminary  prediction,  testing  and  verification  of  data,  and  model 
refinement  and  validation. 

Introduction.  The  Rayleigh  wave  components  of  seismic  waves  are  generally 
considered  as  the  primary  energy  conveyance  that  activates  seismic  sensors. 

The  amount  of  energy  generated  by  a  target  of  military  interest,  such  as  a 
vehicle,  military  equipment,  or  a  specific  military  operation  or  activity,  is 
site  dependent,  and  the  effect  of  terrain  on  Rayleigh  wave  propagation  is  known 
to  be  substantial.  A  mathematical  computer  model  was  developed  at  the 
U.  S.  Army  Engineer  Waterways  Experiment  Station  (WES)  to  predict  microseismic 
signals  in  various  terrain  materials  for  application  to  sensor  systems  (Lundien 
and  Nikodem  1973) .  The  model  uses  site  information  including  compression  and 
shear  wave  velocities  and  material  density  and  thickness  for  multiple  layers, 
along  with  seismic  source  data,  to  simulate  the  generation,  coupling,  propaga¬ 
tion,  and  decoupling  of  seismic  signals  (surface  waves)  from  the  source  to  the 
sensor.  This  concept  is  illustrated  in  Figure  1,  which  shows  the  seismic  wave 
propagating  from  the  source  to  the  sensor. 

Model  Description.  The  model  develops  mathematics  within  a  framework  of 
theoretical  mechanics  to  simulate  the  Rayleigh  component  of  a  seismic  wave  in 
a  layered  medium.  Rayleigh  waves,  or  elastic  surface  waves,  are  formed  when  a 
bounding  surface  exists  on  an  elastic  half  space.  The  particle  motion  of 
Rayleigh  waves  is  elliptically  retrograde  in  a  plane  perpendicular  to  the 
bounding  surface  and  parallel  to  the  direction  of  pr  pagation.  The  Rayleigh 
wave  propagation  velocity  is  smaller  than  that  of  ci  jpression  or  shear  waves 
and  in  most  soils  is  generally  about  90  percent  of  the  shear  wave  velocity, 
which  in  turn  is  about  40  to  50  percent  of  the  compression  wave  velocity.  A 
Rayleigh  wave’s  energy  per  unit  area  (usually  expressed  as  a  form  of  particle 
motion)  decreases  with  increasing  range  (as  it  spreads  over  an  increasing 
area).  This  is  termed  geometric  attenuation,  which  causes  a  decrease  in  parti¬ 
cle  motion  amplitude  proportional  to  1/\^F  where  r  is  the  range  from  the 
seismic  source.  Of  the  total  seismic  energy  input  by  the-  source,  approximately 
two-thirds  is  transmitted  away  by  a  Rayleigh  wave. 


In  addition  to  geometric  attenuation,  there  is  also  a  loss  of  energy  from 
internal  friction  in  which  some  of  the  mechanical  energy  is  converted  into 
heat.  This  internal  damping  increases  with  frequency.  Rayleigh  waves  with 
short  wavelengths  (high  frequencies)  are  attenuated  more  rapidly  with  depth 
than  those  with  long  wavelengths  (low  frequencies).  When  a  Rayleigh  wave 
travels  in  a  horizontally  layered  medium  in  which  its  velocity  is  constant 
within  a  layer  but  different  from  layer  to  layer,  the  wave  will  be  dispersed 
as  it  propagates.  Because  of  dispersion,  the  Rayleigh  wave  can  have  a  low 
velocity  (and  high  frequency)  in  the  surface  layer  and  a  high  velocity  (and 
low  frequency)  in  the  basement  layer,  depending  on  layer  properties. 

Another  effect  of  a  horizontally  layered  medium  is  that  elastic  waves 
originating  at  the  surface  may  be  reflected  or  refracted  by  the  layers.  These 
changes  cause  energy  losses  that  also  affect  attenuation.  The  waves  can  also 
be  polarized  or  even  converted  to  other  wave  forms.  The  effects  of  macrogeo¬ 
metric  features  can  be  simulated  by  frequency-dependent  amplxtude  transmission 
coefficients. 

Closed-form  equations  for  computing  the  seismic  signal  are  derived  from 
classical  equations  of  motion  to  define  the  stress-strain  relationship  in  the 
propagating  medium.  The  development  of  an  economical  and  practical  mathemati¬ 
cal  simulation  of  the  phenomena  shown  in  Figure  1  requires  several  assumptions 
for  simplification.  The  assumptions  describing  the  geometry  and  composition 
of  the  system  are:  (a)  the  medium  is  composed  of  homogeneous  and  isotropic 
horizontal  layers,  (b)  all  layer  boundaries  are  smooth  and  abrupt,  and 
(c)  major  discontinuities  at  the  surface  are  large  compared  with  the  signal 
wavelength.  The  assumptions  describing  wave  motion  are:  (a)  all  predictions 
are  made  for  a  point  source,  which  means  that  source  contact  dimensions  are 
small  compared  to  the  signal  wavelength,  (b)  all  motions  are  at  a  level  low 
enough  to  remain  within  the  elastic  realm  of  the  material,  and  (c)  the  calcu¬ 
lations  are  made  using  axial  symmetry  (all  inputs  are  vertical  or  horizontal). 

The  equations  of  motion  used  herein  represent  the  propagation  of  a  distur¬ 
bance  that  involves  both  equivoluminal  and  irrotational  motion.  Separate  wave 
equations  can  then  be  obtained  for  each  type  of  motion.  These  equations  are 
translated  to  cylindrical  coordinates  so  that  axial  symmetry  can  be  applied. 

They  are  then  solved  by  applying  boundary  values.  The  end  result  is  the  gen¬ 
eral  expression  shown  in  Figure  2.  It  should  be  noted  that  the  more  general 
form  of  this  expression  makes  use  of  Fourier  transforms  to  widen  the  scope  of 
problems  that  can  be  evaluated. 

The  conceptual  steps  for  prediction  of  seismic  signals  are  shown  in  the 
block  diagram  in  Figure  3.  This  diagram  is  a  generalization  of  the  actual 
computational  sequence  used  in  the  comouter  program.  The  essential  steps  shown 
in  Figure  3  are  used  to  explain  the  computational  procedures  that  are  basically 
composed  of  two  parts:  (a)  computation  of  the  site  coefficients,  and  (b)  trans¬ 
formation  of  the  input  forcing  functions  to  a  predicted  signal. 

The  site  coefficients  that  must  be  determined  to  predict  a  signal  include 
(a)  wave  numbers,  (b)  source  coupling  coefficients,  (c)  Hankel  function  (i.e., 
the  transmission  coefficients),  and  (d)  surface  macrogeometry  coefficients. 


After  the  source  coupling,  transmission,  and  surface  macrogeometry  coeffic¬ 
ients  are  computed,  they  are  combined  for  each  wave  number  of  interest.  This 
is  accomplished  by  taking  products  of  the  three  coefficients  for  each  frequency 
and  mode  and  summing  the  model  results,  one  sum  for  each  frequency. 

The  site  coefficients  described  above  are  used  with  a  stress-time  func¬ 
tion  and  particle  motion  coefficients  to  arrive  at  a  prediction  of  particle 
velocity,  particle  displacement,  or  particle  acceleration  (see  Figure  2).  The 
stress-time  function  and  the  site  coefficients  are  combined  to  arrive  at  a 
prediction  of  a  signal  of  particle  motion  versus  time  at  some  range  r  .  To 
arrive  at  a  predicted  signal  in  •  time  domain,  the  inverse  Fourier  transform 
must  be  taken  of  the  product  01  combined  site  coefficients,  the  stress¬ 
time  Fourier  coefficients,  and  the  particle  motion  coefficients-.  This  time 
domain  signal  is  repetitive  as  a  result  of  the  periodic  input  source. 

Test  Program.  Since  its  development,  the  WES  seismic  model  has  been  used 
for  analysis  of  data  for  numerous  seismic  sensor  systems  that  deal  only  with 
short  ranges  (up  to  1  km) .  Present  military  systems  require  knowledge  of  activ¬ 
ities  producing  seismic  signals  out  to  10  km  and  beyond.  The  Army  Seismic 
Attenuation  Test  (SAT)  program  was  conducted  in  the  spring  of  1983  at  the 
U.  S.  Army  White  Sands  Missile  Range  to  provide  a  database  of  long-range  (up 
to  10-km)  data  for  model  refinement  and  to  document  other  phenomena.  The  WES 
seismic  model  was  used  to  make  predictions  for  the  tests,  using  preliminary 
seismic  data  for  the  alluvial  valley  test  site.  These  predictions  are  shown 
in  Figures  6-8  as  phase  velocity,  transmission  coefficients,  and  transfer  func¬ 
tions.  Tests  were  conducted  using  an  electrohydraulic  vibrator,  a  vacuum 
impulse  loader,  an  M-35  vehicle  out  to  1  km,  and  explosions  from  1  to  10  km. 

Data  were  recorded  at  a  stationary  array  of  geophones  and  on  a  portable  group 
of  geophones  emplaced  near  the  source.  Approximately  2000  channels  of  data 
were  recorded  during  this  extensive  test  program  (See  Figure  8  for  site  lay¬ 
out)  . 


Analysis.  The  analog  data  w  're  digitized  and  then  processed  using 
several  signal  processing  techniques,  such  as  band  pass  filtering,  Fourier 
transforms,  and  cross  correlation  to  analyze  attenuation,  transfer  function, 
source/array  spectral  densities,  time  of  arrival  (mode  analysis),  and 
time/frequency  analysis.  Figures  8-11  show  some  of  the  processed  data,  while 
Figures  12-14  show  attem  :tion  curves  for  vibration  and  explosive  tests  and 
calculated  transfer  functions.  Model  verification  was  begun  upon  completion 
of  the  data  analysis. 


To  compare  actual  data  with  the  model  predictions,  processed  data  were 
used  to  calculate  attenuation  ratios  for  various  tests,  as  can  be  seen  in  Fig¬ 
ures  15  and  16.  The  model  was  then  used  to  match  attenuation  ratios  for  the 
same  frequencies  by  varying  the  internal  damning  factor  (IDF) .  The  results 
can  be  seen  in  Figures  18  and  19,  which  show  IDF  versus  frequency.  A  rule-of- 
thumb  estimate  of  0.03  for  IDF  was  used  to  make  all  predictions  for  the  SAT 
program.  The  data  reveal  that  the  0.03  value  is  not  a  bad  estimate,  but  that 
the  calculations  could  be  refined  somewhat  by  having  an  IDF  of  0.02  or  a  value 
dependent  on  range. 
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The  effect  on  the  predicted  transfer  functions  as  a  result  of  changing 
che  IDF  can  be  seen  in  Figure  19.  The  short-range  (Q.1-,  0.5-,  and  1,0-km) 
site  transfer  functions  for  an  IDF  of  0.02  compare  vary  favorably  with  both 
the  actual  data  and  the  functions  for  an  IDF  of  0.03.  The  long-range  (2-,  4-, 
and  10-km)  site  transfer  functions  for  an  IDF  of  0.02  compare  more  favorably 
with  the  actual  data  than  the  long-range  functions  for  an  IDF  of  0.03. 

Conclusions.  This  analysis  is  not  complete.  Further  model  refinements 
now  being  attempted  include  varying  IDF  with  frequency  and  range  and  using 
overall  site  information  determined  by  the  field  tests.  Further  data  analysis 
also  being  accomplished  with  the  field  test  data  includes  averaging  of  data 
for  a  number  of  geophones  to  reduce  the  problem  of  background  noise  on  data 
that  are  totally  masked  by  wind  noise;  verifying  Rayleigh  wave  motion  using 
vertical,  radial,  and  transverse  geophone  data  to  plot  particle  motion;  and 
calculating  the  point  at  which  transfer  functions  are  affected  by  background 
noise. 

The  conclusions  reached  in  this  study  are:  seismic  waves  are  dispersed 
such  that  very  low-frequency  (2-  to  3-Hz)  signals  which  travel  in  deep,  high- 
velocity  layers  are  the  only  compccent  remaining  beyond  4  km;  the  resolution 
of  seismic  signals  measured  out  to  10  km  is  below  0.5  Hz;  and  wind  noise  is  a 
problem  in  making  seismic  measurements  over  long  ranges  for  wind  speeds  abov-ce 
8  m/sec. 

This  study  has  been  conducted  as  a  "generic”  study  in  nrd*r  that  the 
results  can  be  applied  to  many  programs,  especially  those  for  which  the  WES 
seismic  model  can  be  used  to  extrapolate  and  predict  information  for  short- 
and  medium-range  seismic  signals.  In  addition,  the  field  test  program  has 
enabled  the  use  of  the  WES  seismic  model  for  prediction  of  medium-range  seis¬ 
mic  information,  and  upon  completion  of  the  current  analysis  will  provide  not 
only  model  verification  hut  also  insight  into  capabilities  of  military 
equipment  for  seismic  detection. 

This  study  has  been  documented  in  a  WES  Technical  Report  entitled  "An 
Analysis  of  Short-  to  Medium-Range  Seismic  Attenuation  Tests  Using  a  Multi¬ 
layered  Viscoelastic  Seismic  Propagation  Model"  (Carnes  and  Lundlen  1984) . 


References 

Carnes,  B.  L.,  and  Lundien,  J.  R.  1984.  "An  Analysis  of  Short-  to  Medium- 
Range  Seismic  Attenuation  Tests  Using  a  Multilayered  Viscoelastic  Seismic 
Propagation  Model"  in  preparation,  U.  S.  Army  Engineer  Waterways  Experiment 
Station,  CE,  Vicksburg,  Miss. 

Lundien,  J.  R.,  and  Nikodem,  H.  1973.  "A  Mathematical  Model  for  Predicting 
Microseismic  Signals  in  Terrain  Materials,"  Technical  Paper  M-73-4,  U.  S.  Army 
Engineer  Waterways  Experiment  Station,  CE,  Vicksburg,  Miss. 


Figure  5-  Transmission  coefficient  H  vs.  frequency  predicted  by  model 


Figure  6.  Predicted  transfer  functions  for  SAT  site. 
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Figure  7.  SAT  site  layout. 


Figure  8.  Test  133,  Explosive  Source  at  Iko 
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Figure  15.  Attenuation  ratio  M  vs.  frequency  for  1-km 
discrete  frequency  vibration  tests.  Numbers  indicate 
locations  of  compared  data. 


Figure  19.  Transfer  functions  with  internal  damping  factor  »  0.02 
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ABSTRACT  The  analysis  of  manufacturing  processes  for  metal  sheets,  stresses 

in  elastomer  bumpers  and  other  geometrically  nonlinear  problems  require 
numerical  techniques  which  can  deal  with  continuously  changing  regions  of 
contact.  The  most  common  method  of  dealing  with  the  contact  problem  is  to 
introduce  gap  elements  in  the  contact  region.  These  elements  provide  either 

zero  or  very  large  stiffness  between  two  degrees  of  freedom.  The  methods  for 
determining  gap  element  stiffness  include  computations  which  determine  it 

contact  has  been  made.  In  this  effort,  a  revised  simplex  method  for  quadratic 
programming  is  investigated  for  treating  contact  problems.  The  method  allows 
the  contact  problem  to  be  treated  direc'ly  as  a  minimization  problem  subject 
to  an  inequality  constraint  without  the  need  to  define  gap  elements.  Tb  test 

the  effectiveness  of  the  algorithm  the  problem  of  the  buckled  elastica  in 
contact  with  itself  was  solved.  An  existing  finite  element  formulation  for 
the  extensional  elastica  based  on  the  minunim  potential  energy  principle  is 
used  to  model  the  buckled  elastica.  Solutions  to  the  contact  problem  are  also 
obtained  by  a  penalty  method  using  gap  elements.  Results  from  both  numerical 
methods  are  compared  to  theoretical  results. 

INTRODUCTION  The  problem  of.  a  beam  deforming  near  a  rigid  barrier  was 
recently  solved  by  Westbrook  .  In  his  analysis,  the  potential  energy  for  a 

beam  in  its  quadratic  finite  element  form  and  a  linear  constraint  equation 
(the  rigid  boundary)  are  joined  to  form  a  quadratic  programming  problen^.which 

is  solved  by  Rusin’s  revised  simplex  method  for  quadratic  programming  .  A 
total  Lagrangian  finite  element  formulation  for  extensional  elastica 

(large  deformations)  was  recently  developeo  by  Fried  .  In  this  effort,  the 
potential  energy  for  the  total  Lagrangian  finite  element  formulation  is 
expanded  so  that  the  contact  problem  can  be  approximated  by  a  quadratic 
programming  problem.  Successive  solutions  to  this  quadratic  programming 
problem  converge  to  the  contact  solution.  The  problem  of  a  buckled  elastica 
in  contact  with  itself  was  solved  theoretically  by  Flaherty  and  Keller 
This  problem  presents  an  interesting  test  for  nunerical  algorithms  intended 
for  contact  problems  in  the  presence  of  large  deformations.  Contact  occurs  at 
a  point,  not  a  region,  and  the  location  of  the  contact  point  moves  in  a 
nonuniform  manner  as  the  load  increases.  Nunerical  solutions  are  obtained 
using  both  the  simplex  method  and  the  more  commonly  used  penalty  method.  Both 

methods  compare  well  with  the  asymptotic  solutions  presented  by  Flaherty  and 
Keller  (Ref.  4) . 

The  finite  element  formulation  for  the  elastica  is  presented  and  is  used 
to  compute  known  values  of  the  initial  buckling  load  and  the  initial,  contact 
load.  The  effect  of  axial  stiffness  on  the  buckling  and  contact  loads  is 
investigated  and  an  axial  stiffness  is  selected  which  allows  the  inextensional 
elastica  to  be  approximated.  The  formulation  for  the  analysis  of 
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=  work  done  by  external  forces  on  an  element 

u>;  =  integration  weight. 

« 

The  above  form  of  the  potential  energy  can  be  differentiated  with  respect 
to  the  nodal  variables  to  obtain  an  element  gradient  of  the  potential  energy. 
That  is. 


a  -  dTTe  (4) 

“e  6  u.e 

Similarly,  the  element  tangent  matrix  can  be  obtained  by  differentiating  the 
element  gradient  vector.  That  is, 

Z-Tp 

k  -  ^  ^  («;) 

R« '  du,  1  a  u.| 

These  element  gradient  and  tangent  matrices  can  be  assembled  to  form 
global  gradient  and  tangent  matrices.  The  global  potential  energy  is  minimum 

when  the  gradient  is  null.  The  locations  of  the  minimuis  are  found  by  using 
the  Ne wton-Ra phson  method.  That  is,  given  a  displacement  vector  u{  we  find  a 

displacement  whose  gradient  is  closer  to  zero  by  computing 

ai'-  u<  -  K','s,  w 

The  above  method  was  used  to  obtain  solutions  for  the  initial  buckling, 
large  deformation  and  initial  contact  of  the  elastica.  The  deformed  shape  of 
the  elastica  for  end  loads  of  40,  50  and  7,j>  are  shown  in  Figure  2.  Note,  the 
initial  theoretical  buckling  load  is  471  and  the  initial  contact  load  is 
72.1R7.  to  demonstrate  that  this  extensible  elastica  formulation  can  be  used 
to  accurately  represent  the  inextensible  elastica  the  effect  of  the  magnitude 
of  c  orvthe  buckling  and  contact  load  was  determined,  see  Figure  3.  With 
c  *  in5  the  finite  element  values  for  the  buckling  and  contact  loads  have 
converged  to  three  significant  digits  to  the  theoretical  values  using  only  a 
coarse  mesh  of  14  elements. 

NONLINEAR  CONTACT  USING  QUADRATIC  PROGRAMMING:  puadratic  .  aiming  has  been 
used  to  solve  contact  problems  in  linear  elasticity  vhere  potential  energy 
(P.E.)  expression  and  ccntraint  equation  (C.E.)  can  be  expressed  as  follows. 


TTW  =  }xTi«-PT* 

P.E. 

(7) 

X  >  0 

C.E. 

(fi) 

The  paper  by  Rusin  describes  a  particular  form  of  quadratic  programming  vhich 
can  be  adapted  to  finite  element  contact  problems.  In  the  linear  problem,  K 
is  a  positive  definite  symmetric  matrix  and  p  is  the  load  vector.  The 

algorithm  finds  the  miuimun  ofTT(x)  such  that  equation  (P)  is  also  satisfied. 


The  geometrically  nonlinear  contact  problem  is  not  in  the  form  given  by 
equations  (7)  and  (P) .  However,  if  we  expand  the  nonquadratic  form  of  the 

potential  energy,  in  a  Taylor  expansion  near  the  contact  solution  we  can  solve 
the  contact  problem  by  solving  a  sequence  of  linear  problems.  Figure  4  shows 
a  Newton-Raphson  solution  to  the  nonlinear  buckled  elastica  problem  obtained 
without  consideration  of  contact  and  the  near-by  contact  solution.  Note,  in 

this  Lagrangian  formulation  x  is  a  configuration  vector,  not  a  displacement 
vector.  With  xQ  =  the  vector  obtained  by  minimizing  the  potential  energy  and 
x  =  the  vector  near  x  which  is  the  minimum  of  the  potential  energy  subject  to 
the  constraint  equation  the  potential  energy  is  expanded  as  follows. 

TTM  =  *n(vu0  +  (9) 

where  AX  =  X  -  X0 

with  ~  O  (Newton-Raphson  solution)  equation  (9)  can  be  arranged 

as 

TTCx)  =  4  xTK0X.  -  ul  K„) X  +  [TT ( XJ  +  -  xj  K0X J  +  •  -  •  <m> 


The  term  in  the  brackets  in  (It?)  is  constant  and  the  first  two  terms  are  in  a 
form  which  can  be  used  with  the  quadratic  programming  algorithms. 

To  continue  the  discussion  the  contact  problem  is  considered  for  the 
buckled  elastica.  The  vector  of  nodal  unknowns  is  given  by  xT  and  the  vector 
involved  in  the  contact  constraint  is  uT  where 


XT  =  (x,  i.Y.Y,  xzxt  •••  YHYN) 


(11) 


and  U  •••  XNvfN')  (12) 

Using  symmetry  to  solve  the  buckled  elastica  problem  as  indicated  in  Figure  4 
the  constraint  equation  is 


U  >  O 


(13) 


Then,  given  x  an  approximate  contact  solution  can  be  found  by  solving  the 
following  quadratic  programming  problem. 


Find 

rr\in  <4>CX) 

with 

U  >  O 

mmf  ^  x\x  -  pjx] 
vx  1  J 


(14) 
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where 


and 

To  obtain  the  optimal  contact  soution  the  quadratic  programming  problem  is 
repeated  by  recomputing  Ko  ana  Bo  at  each  solution  of  (14)  until  the  region  of 
contact  no  longer  cnanges.  One  last  Newton-Raphson  solution  obtained  while 
enforcing  the  appropriate  region  of  contact  yields  the  solution  to  the 
nonlinear  contact  problem. 

The  method  described  above  was  used  to  obtain  post  contact  solutions  to 
the  buckled  elastica.  The  location  of  the  contact  point  with  respect  to  the 
loaded  aid  of  the  elastica  was  computed  as  a  function  of  the  loading  for  a 
uniform  mesh  of  A P  elements  and  is  shown  in  Figure  5.  At  some  loads,  two 
contact  points  ware  indicated.  In  all  such  cases,  the  actual  contact  point 
was  located  between  them.  That  is,  when  two  contact  points  were  computed  they 
were  never  on  .one  side  of  the  actual  point  oontact  solution  given  by  Flaherty 
and  Keller  .  Some  solutions  were  obtained  for  a  uniform  mesh  of  80 

elements.  The  resulting  contact  point  locations  were  approximately  twice  as 
close  to  the  analytical  solution. 

nonlinear  CONTACT  using  THE  PENALTY  METHODS:  The  use  of  gap  elements  to  treat 
contact  was  described  by  Chan  ana  Tuba^  ' .  This  method  is  similar  to  the 
penalty  method'  .  The  penalty  method  is  easily  adapted  to  the  Lagrangian 

finite  element  formulation  being  used  here  and  provides  another  finite  element 
method  to  compare  with  the  analytical  solution.  The  contact  constraint  as 
stated  in  equation  (13)  can  be  obtained  in  a  least  squares  sense  by  adding  the 
square  of  the  nodal  value  (i.e.,  xn  or  y  )  for  any  nodal  variable  that  fails 
to  satisfy  equation  (13).  On  an  Element  level  this  is  done  by  adding  the 
product  of  the  squared  terms  and  the  penalty  parameters  as  follows  (only  Xn’s 
are  included  here) . 


K, 


_  Sir 


b  X 


&  = 


-  TTe  + 


I  V" 


xn<o 


(15) 


where  £>np  =  Cp  K-nn  lb  -  penalty  parameters  or  gap  element  stiffness 

a  variable  for  convergence  studies 
^  t  */z  if  one  or  no  adjacent  nodes  nave  <  P 
**  L  I  if  both  adjacent  nodes  have  x  <  0 

K_*  diagonal  term  from  the  tangent  stiffness  matrix  associated  with  the 
xn  displacement. 


The  selection  of  the  penalty  parameter  chosen  here  reflects  the  ideas 
presented  by  Chan  and  Tuba  in  vxiich  gap  element  stiffnesses  are  selected 
based  on  thie  current  global  tangent  stiffness  matrix  and  the  portion  of  the 
element  for  which  the  constraint  is  violated  (thus,  knn  and  cn  as  described 
above) . 
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The  computation  of  the  element  gradient  and  tangent  stiffness  is  straight 
forward  and  proceeds  as  follows. 


The  element  gradient  is  computed  as 

i  ^ire 

“  du.g  * 

9e  “  9e  +  9e  p 

and  the  tangent  matrix  is  found  from 


lx  Vi] 


K«  1  fu{  =  Ke  +  au|[£XpX“] 


K.  -  K„  +  K  >17’ 
e  €  ep 

The  assembly  of  the  global  gradient  and  tangent  matrices  and  the  introduction 
of  displacement  boundary  conditions  are  again  similar  to  the  methods  used  in 
linear  analysis.  Then,  the  Newton-Rapnson  method  is  applied  to  minimize  the 
global  form  of  equation  (15).  As  an  example  when  and  Xntare  two 
variables  which  violate  the  constraint  equation  (13)  the  gradient  is 
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and  the  tangent  stiffness  is 


.vV-VX'.'vN 


The  buckled  elastica  contact  problem  was  solved  for  an  end  load  of  8fl. (i 
and  the  range  of  penalty  parameters  indicated  in  Table  3.  The  location  of  the 
loaded  end  of  the  elastica  appears  to  be  converged  to  s  or  7  decimal  places 
wnen  Vn  =  CflKnn'0  •  Convergence  with  respect  to  mesh  size  was  also  considered 

and  tne  results  are  shown  in  Table  2.  After  the  penalty  parameter  was 
selected,  convergence  with  respect  to  mesh  size  was  numerically  determined. 
The  noshes  chosen  for  the  convergence  study  in  Table  2  all  have  an  identical 
contact  point  location  so  there  was  no  effect  from  a  shifting  contact  point 
due  to  a  finer  mesh.  A  uniform  mesh  with  8P  elements  was  required  to  obtain 
convergence  to  about  five  significant  digits  in  the  displacements.  Ibis  may 
be  due  to  the  fact  that  there  is  a  large  curvature  along  the  short  portion  of 
the  elastica  between  the  loaded  end  and  the  contact  point  wiich  is  difficult 
to  approximate  with  a  coarse  mesh. 

The  location  of  the  nodes  in  contact  was  determined  as  a  function  of  the 

load  for  a  uniform  mesh  of  4(*  elements.  The  results  are  shown  in  Figure  6. 
They  agree  well  with  Flaherty  and  Keller's  results  in  that  the  location  of 
contact  is  always  known  within  the  length  of  an  element.  A  more  sophisticated 
choice  of  the  penalty  parameter  (  Cn  in  particular)  would  improve  the  results. 

CONCLUSION:  The  use  of  quadratic  programming  and  penalty  methods  for  the 
analysis  of  geometrically  nonlinear  contact  problems  were  presented.  Of  the 
two  methods,  the  penalty  method  was  easier  to  adapt  to  the  total  Lagrangian 
finite  element  formulation  being  used.  The  solutions  obtained  using  the 
simplex  method  implied  point  contact  vhicn  is  in  agreement  with  theory.  The 
penalty  method  as  used  here  was  unable  to  predict  point  contact  as  clearly  as 
the  simplex  method.  For  a  given  mesh,  the  penalty  method  requires  that  a 
convergence  study  be  made  with  respect  to  the  penalty  parameter,  whereas,  the 
simplex  method  determines  the  results  in  one  analysis.  Both  methods  gave 
satisfactory  results  when  the  location  of  contact  was  determined  as  a  function 
of  the  load.  Additional  work  could  include  modifying  the  methods  in  such  a  way 
that  regional  and  point  contact  can  be  easily  determined  numerically. 
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ABSTRACT.  We  consider  the  problem  of  computing  the  stationary  flows  of  a 
viscoelastic  fluid  with  memory  flowing  through  a  given  domain.  The  proposed 
numerical  technique  is  basftd  on  optimal  control  technique?,  which  replace  the 
original  equations  of  the  problem  by  a  minimization  prob^^w  to  be  solved  by  a 
conjugate  gradient  algorithm.  Such  techniques  are  very  powerful  and  can 
handle  equations  which  change  type,  provided  that,  as  done  here,  one  use?  an 
adequate  preconditioning  operator  and  that  one  computes  efficiently  the 
gradient  of  the  function  to  be  minimized. 

I.  EQUATIONS  OF  THE  PROBLEM.  Let  ua  consider  a  viscoelastic  fluid  with 
memory  (of  upper-convected  Maxwell  type),  flowing  viscously  through  a  given 
domain  8  (Fig.  1).  We  suppose  that  no  slip  occurs  along  the  solid  walls  and 
that  the  fluid  velocity  at  the  entrance  and  at  the  exit  of  the  domain  is 
given. 


<x.y)=  (c.O)  ? 


Fig.  Hie  physical  problem 
(out  of  Malkus  [1984] ) 

4? 


tiltlii 


For  examples,  such  situations  arise  while  studying  plane  flows  over  plots, 
such  as  those  studied  experimentally  by  Bird  ar.d  al.  [1982].  The  equations 
governing  such  situations  are  simply: 


(1)  EQUILIBRI* . ' 

-div(cr)  +  p(u*V)u  =  f  in  £  , 

'V  •  /V,  «v  ^ 

(2)  CONSTITUTIVE  LAW  (upper-convected  Maxwell) 


£  =  SD  ~  / 


2n<X-t)  =  J  -U-expl^f^)[(F>  )_1  -  Ijdx  , 


.-1 


-00  X 


~t~t 


3X  (X,T) 

VS'T) - 51—  ' 

/V 

X^_(x ,x )  =  position  at  t  of  the  particle  which  is  in  £  at 

time  t  and  which  is  subjected  to  the  velocity  field  u. 

(3)  KINEMATIC  RESTRICTIONS 

div  u  =  0,  u  =  u  on  I*  . 

/v  O,  0 

Above,  u  represents  the  fluid  velocity,  a  the  Cauchy  stress  tensor,  p  a 
hydrostatic  pressure,  p  the  fluid  dei.sity,  U  the  fluid  viscosity  and  X 
the  relaxation  time.  Observe  that,  as  an  extra  boundary  condition,  the 
constitutive  law  (2)  requires  the  knowledge  of  what  happened  to  the  fluid 
before  it  enters  the  domain. 


It  has  been  observed  in  Joseph,  Renardy,  Saut  [1984],  that  these 
equations  change  type  when  tr.a  viscoelastic  Mach  number  j//p/Xp  reaches  1 
(U  is  a  characteristic  velocity  of  the  considered  flow).  Real 
characteristics  then  appear  along  which  the  vorticity  can  be  discontinuous. 
However,  most  numerical  methods  employed  for  solving  (1)-(3)  (such  as  the 
classical  fixed  point  method  solving  iteratively  for  the  velocities  and  then 
for  the  stresses)  cannot  handle  this  change  of  type. 

The  idea  of  this  paper  is  to^ employ  for  the  numerical  solution  of  ( 1 )-(3) 
optimal  control  techniques  in  H  (fl),  which  were  used  with  success  in 
transonic  flow  computations  (Glowinski  [1984]),  where  a  similar  change  of  type 
occurs.  To  be  applied,  such  techniques  require  .jthe  definition  of  an 
appropriate  isomorphism  between  HQ(fi)  and  H  [SI)  (the  preconditioning 
operator)  and  the  computation  of  the  transpose  of  the  linearized  constitutive 
equations .  Here,  for  Maxwell  viscoelasticity,  this  turns  out  to  be  very 
natural:  the  Stokes  operator  acts  as  a  very  good  preconditioning  operator  and 
the  transpose  of  the  constitutive  laws  leads  to  equations  strongly  related  to 
a  lower-convected  Maxwell  model  with  opposite  velocities. 
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2.  H" 1  LEAST-SQUARES  FORMULATION  OF  THE  PROBLEM. 

2.1  A  one-dimensional  model  problem.  Let  f  :  R  R  be  differentiable 
and  consider  the  problem  of  solving  numerical?;*  the  nonlinear  equation 

f ( x)  *  0  . 

If  it  has  a  solution  in  R,  then  this  equation  is  equivalent  to 

Minimize  -r  |f(x)|2  over  R,  (a  >  0)  , 

i  nii  ii«ii--  31  11 

problem  which  can  be  numerically  solved  by  the  gradient  a? gorithm 

+  x0  =  given  , 

+  for  n  =  0  until  satisfied  set 
xn+1  “  xn  ~  \  f<xn>f'(xn>  • 

This  algorithm  can  be  a  very  efficient  method  for  solving  f(::)  =  0,  provided 
that  a  is  properly  chosen  and  that  f(x)f'(x)  is  easy  to  compute.  It  will 
be  the  basis  of  the  numerical  technique  that  we  will  use  to  solve  (1)-(3). 

2.2  Maxwell  viscoelasticity.  Let  V  be  the  space 

V  =  {v  e  hJ(Q),  div  v  ■=  0}  , 

let  V*  be  its  topological  dual,  denote  by  <*,*>  the  duality  pairing 
between  V  and  V*  and  introduce  the  following  operators 

A  ;  V  +  V*,  A(v)  =  -div[y(Vv  +  VyT)]  ,  (Stokes) 

L  :  V  *  R,  L(v)  =  /  f»vdx  , 

8 

T  :  V  V*,  T(v)  =  p(u»V)u  -  div(0_(u))  -  A(v)  with 

4 

u  =  v  +  u„,  a  (u)  being  given  by  the  constitutive  rela  ion  (2), 

With  these  new  notations.  Equations  (II  to  (3)  take  the  form 

f A(u  -  u0)  +  T(u  -  uQ)  -  L  =  0  in  V*  , 

(4)  N 

(h-Uo  e  v  • 

If  (4)  has  a  solution,  then  (4)  is  obviously  equivalent  to  the  H  1  least- 
squares  formulation: 
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(5) 


MINIMIZE  J<v)  «j<Ay(v),  y(v)>  OVER  V  WHERE 
X<v)  6  V  IS  THE  SOLUTION  OF  THE  LINEAR  PROBLEM 
Ay(y)  =  Ay  r(v)  -  L  in  V*  . 


Indeed/  if  (4)  has  a  solution  v  =  u  -  u_,  and  if  we  take  v  =  v  in 
(5)/  then  the  right  hand  side  of  (5)  is  equal  to  zero,  thus  the  associated 
state  vector  y(v)  is  also  equal  to  zero,  arid  therefore  J(v)  is  equal  to 
zero.  Since  J(v)  is  positive  by  construction  (A  is  a  monotone  operator 
on  V),  this  implies  that  v  is  a  minimizer  of  J  over  V. 

Conversely,  let  w  be  a  minimizer  of  J  over  V.  As  seen  above,  since 

(4)  has  a  solution,  J  attains  the  value  0  on  V.  Thus,  J(w)  must  be 
equal  to  zero,  therefore  y(  w)  must  be  equal  to  zero  (the  Stokes  operator 
A  is  strictly  monotone  on  V).  By  construction  of  y(w),  this  implies  that 

the  right-hand  side  of  (5)  is  equal  to  zero  when  we  take  v  =  w,  which  means 

precisely  that  w  is  a  solution  of  (4). 

In  summary,  if  we  assume  the  existence  of  a  solution  to  our  original 
problem  (1)-(3),  we  can  replace  these  equations  by  the  equivalent  minimization 
problem  ju3t  written  above.  This  minimization  formulation  is  the  one  which 
will  be  used  in  our  numerical  techniques.  It  reduces  our  initial  problem  to 
an  optimal  control  problem,  if  we  identify  v  to  a  control  variable,  y  to  _a 
state  vector,  (5)  to  a  state  equation  and  J( • )  to  a  cost  function. 

3.  CONJUGATE  GRADIENT  METHOD. 


The  minimization  formulation  of  §2  is  interesting  because  it  can  be 
solved  numerically  by  a  conjugate  gradient  algorithm  which  has  superlinear 
convergence  properties.  This  algorithm  is: 
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L-.-2  .  . 

i  •  .  /•  t.  r  »  *  L'*  .  ■ 

K.Vt'-'ivy.v.- 


INITIALIZATION 


+  Take  u 

+  Solve 
+  Set  z® 


0  €  V  +  U. 

~0 

<A(3° )  ,w> 


=  0  until  satisfied. 


«  <J'(u°  -  u„: 
~  ~0 

LOOP 

do 


0 


1  4*1 


5  *  \  +  if  £efi» 


d  -  o  if  x  ^  n  , 


Z~t{2'x)  "  97-  <2'T)  * 


X~{x,T)  =  position  at  time  t  of  the  particle  which 

is  in  x  at  time  t  and  which  is  subjected 
to  the  velocity  field  u  (x)  *  -r(x). 


This  adjoint  state  can  be  computed  by  am  explicit  integration  along  the 
trajectories  of  JJ  =  X  +  jjq.  Then,  we  can  prove 


THEOREM;  The  term  <J' (v) ,w>  is  equal  to 


<J*(v),w>  =  /  p{(w*V)u  +  V ) w> v)dbc 
ft 


+  J  {p(Vw  +  VwT)  +  X(Vw)«  +  Xo_(Vw)T  -  X(w»V)o  }«H(v)dx 

J  r  A«W  /wj)  (V*w  #w  «w  #wj)  >W  4W  «v 


Proof:  By  definition  of  the  gradient,  we  have 


<j'(v),w>  =  lim  tJ(v  +  tw)  -  J(v)]  -  <A6y,y>  , 
t+0 


where  y  =  y(v)  is  the  solution  of  the  sta'.e  equation  (5)  and  where  6y  is 
obtained  from  w  by  differentiation  of  (5),  that  is  by  solving 


A5y  =  Aw  +  T'  { v  *  *w  in  V* 

ava  aw 


Substituting  this  definition  of  6y  in  the  expression  of  the  gradient.,  we  get 


<J'(v),w>  **  <Aw  +  T' (v)*w,y>  . 

Aa  Aa  Aa  Aa  Aa  ^a 


Using  the  definition  of  T( • ) ,  and  integrating  by  parts  the  term  in 
div(2^(jj)»jj),  this  gives 


<J*(v),w>  =  /  {p(w*;7hu  +  P(u»V)w}»ydx  +  /  to^(u)«w]«D(X)dx  , 
ft  ft 


with  D(y)  =  ~  (Vy  +  VyT) .  in  (9),  to  compute  the  action  of  the  derivative 
of  0  7u)  on2  w,  we  differentiate  the  constitutive  law  (2),  first  with 

<vn  tv  ^a  * 

respect  to  time,  then  with  respect  to  the  velocity  u.  We  obtain 


(10) 


"t  ■  satisfies  the  differential  equation 

-  x<2s>i  -  xi<vs>T  + 1 

-  2UD(W)  -  Atw'V^u)  +  MXx>2d<2>  +  x£d(u)(Vw)T  , 

x  *  0  on  (*  part  c:  r  with  jjq  •  jj  <  0)  . 

On  the  other  hand,  by  differentiating  the  adjoint  state  equation  (6)  with 
respect  to  time,  we  have 

(11)  /  I*E<X)*S  =  I  -  A(Vu)TH  -  XH(Vu)  +  H]dx  . 

a  a 

Integrating  (11)  by  parts,  and  taking  the  incompressibility  constraint 
div  u  =  0  into  account,  (11)  yields 

/  raw*3*  *  /  s*{X(s*Z)i  -  xi<Zs)  -  x<Xs)x  +  i>*s  » 

a  a 

which,  from  (10),  is  equivalent  to 

(12)  /  T*D(X)dx  =  /  H-{2UD(w)  -  A(w»V)o  +  A(Vw)o  +  Ao  (Vw)T}dx  . 

a  a 

Plugging  (12)  back  in  (9)  finally  gives  (8)  and  our  proof  is  complete . 

5,  FINAL  FLOW  CHART. 

With  the  gradient  given  by  (8),  the  final  flow  chart  corresponding  to 
conjugate  gradient  method  of  $3  is 


INITIALIZATION 


+  u  given; 

+  compute  £d(u°  -  uQ)  by  INTEGRATION  OF  (2)  ALONG  TRAJECTORIES; 

0 

+  compute  the  right-hand  side  r  of  the  state  equation  ( 5 ) ; 

+  compute  the  state  vector  y°  by  SOLVING  THE  LINEAR  STOKES  PROBLEM  (5); 

+  compute  the  adjoint  state  H°  by  INTEGRATION  OF  (6)  ALONG  TRAJECTORIES; 

+  compute  the  right-hand  side  £  of  the  gradient  equation  by  (8); 

+  compute  the  gradient  3°  by  SOLVING  A  LINEAR  STOKES  PROBLEM; 

A  _  0  _  0 
+  set  z  =  3  • 

LOOP  ON  n  >  0 

+  compute  p  =  Argmin  (J(un  -  uQ  -  pz”) )  BY  POLYNOMIAL  INTERPOLATION; 

+  set  un+1  =  un  -  p  zn; 

-  ~  ~ +1  n 

+  compute  0  (u  -  U  )  BY  INTEGRATION  OF  (2)  ALONG  TRAJECTORIES  OF 

n+1  ^ 

u  +  u. ; 

~  ~0 

+  compute  the  right-hand  side  r  of  the  state  equation  (5); 

+  compute  the  state  vector  xn+1  SOLVING  THE  LINEAR  STOKES  PROBLEM  (5); 

+  compute  the  adjoint  state  Hn+1  by  INTEGRATION  OF  (6)  ALONG  TRAJECTORIES; 
+  compute  the  right-hand  side  3  of  the  gradient  equation  by  (8); 

+  compute  the  gradient  3n+1  by  SOLVING  A  LINEAR  STOKES  PROBLEM; 


+  compute 


+  end  loop. 


n+1  _  n+1  .  n  ..n+1  n+1  n  .  n  n 

5  =  2  +  £  <A2  '2  “  2  >  f  <A2  '2  >  • 


Here,  the  minimization  of  J(u  -  uQ  -  pz  )  by  quadratic  polynomial 
interpolation  is  achieved  by 

+  computing  J(0)  =  <Ajrn ,£n>  =  rn»j£n; 

+  computing  J'(0)  =  -<A3°,zn>  =  -;jn,zn; 

+  computing  ^  =  u11  -  Pn_,zn? 

+  computing  ^  by  inverting  (5)  with  v  =  v^; 

+  computing  \  <  AXi'X<|>? 


+  setting  p  equal  to  the  rainimizer  of  the  parabola  which  agrees 
n 

with  J  twice  at  0  and  once  at  p  . . 
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NUMERICAL  IMPLEMENTATION 


The  practical  implementation  on  a  computer  of  the  above  flow  chart 
requires  the  solution  of  two  numerical  problems  * 

(i)  what  type  of  approximation  can  be  used  for  the  numerical  solution 
of  the  Stokes  problems? 

(ii)  what  numerical  integration  techniques  can  be  used  for  the 

integration  along  the  trajectories ,  while  respecting  the  mechanical 
objectivity  of  the  process? 

Those  problems  are  strongly  interconnected  since ,  for  example,  the  finite 
element  which  is  used  determines  the  aspect  of  the  computed  trajectories.  D. 
Malkus  f1984]  proposes  answers  which  are  very  attractive  because  they  respect 
the  physical -structure  of  the  problem.  His  technique  decomposes  as  follows: 

(i)  choice  of  an  exactly  incompressible  piecewise  linear  finite  element 


(such  as  the  linear  crossed  triangle)  for  approximating  the  velocity  field; 


(ii)  exact  computation  of  the  trajectories  incoming  at  the  center  of 


each  finite  element  through  a  piecewise  analytical  solution  of  the  ordinary 
differential  equation 

V*'T>  “  V2't}  *  £  ; 

(iii)  computation  of  the  deformation  gradient  history  by  solving 
analytically  the  equation 

f^.(Xt(x,T))  -  Vu[Xt(x,T)]|^(Xt(x,T)),  F^t)  «  l  ; 


*.  •,.*  « ■ .  * .  * 
-  #  %  *.#  •  *  -  • 


(iv)  computation  of  the  added  stresses  by  a  La guerre  type  numerical 

quadrature 


t  NT 

=  /  "^(T)  [£t(X,T)ldT  -  l  W(Ti)m1(Ti)  (Ft(X,Ti))  . 

i=1 

The  numerical  quadrature  of  (iv)  slightly  changes  the  constitutive  law 
but  respects  its  objectivity  since  the  trajectories  and  deformation  gradients 
are  exactly  computed. 

If  we  use  D.  Malkus'  ideas,  our  numerical  technique  for  the  solution  of 
( 1 )— ( 3 )  finally  reduces  to 

a)  the  transformation  of  the  original  equations  (1)-(3)  into  an 
equivalent  minimization  formulation  (§2), 

b)  the  solution  of  this  minimization  problem  by  the  conjugate  gradient 
algor it’  i  of  §5, 

c)  the  solution  of  each  Stokes  problem  involved  in  this  algorithm  by  a 
finite  element  method  using  linear  crossed  triangles  for  the 
approximation  of  the  velocity  field, 

d)  the  numerical  integration  of  the  integrals  along  trajectories  by  a 
La guerre  numerical  quadrature,  the  trajectories  being  computed 
analytically. 

If  needed,  an  upwinding  scheme  can  be  added  to  this  method,  simply  by 
replacing  in  the  state  equation  (5)  the  term  (u*£)u  by 

-r  [u(x)  -  u(X  Ix,t  -  e))]  . 

C  ^  ^  ^  t 

This  replacement  renders  our  method  fully  deterministic  and  corresponds,  in 
the  case  of  a  transonic  flow  computation,  to  the  usual  entropy  condition. 

7.  CONCLUSIONS 

As  described,  the  proposed  numerical  technique  can  be  expected  to  be 
cheap,  because  ic  only  involves  fixed  positiva  definite  sparse  matrices 
(namely  those  associated  to  the  linear  Stokes  problem),  and  to  be  able  to 
handle  hyperbolic  situations,  because  it  is  based  on  a  least-squares 
formulation.  Moreover,  it  respects  the  physical  objectivity  of  the  problem 
and  uses,  in  a  critical  way,  the  special  structure  of  the  adjoint  problem. 

Here,  we  have  restricted  ourselves  to  the  case  of  Maxwell 
viscoelasticity.  Of  course,  the  proposed  techniques  can  be  generalized  to  any 
viscoelastic  constitutive  law,  provided  that  we  can  easily  compute  the  term 
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ABSTRACT 


Mathematical  logic  is  a  particularly  fruitful  vehicle  for  expressing  programs  in  a 
declarative  style  and  seems  well  suited  for  artificial  intelligence  applications.  On 
the  other  band,  Horn  clause  logic  limited  to  conjuction  and  disjunction  is  some* 
what  primitive  and  low  level.  When  programs  are  written  as  arbitrary  n-ary  rela¬ 
tions,  as  in  Prolog,  hierarchically  constructing  more  complex  programs  from 
simpler  ones  is  often  quite  awkward. 

John  Backus,  in  his  1977  Turing  award  lecture,  described  a  well  structured 
and  expressive  functional  programming  style  along  with  a  useful  algebra  of  pro¬ 
grams.  However,  programs  as  functions  are  less  general  than  programs  as  rela¬ 
tions. 

This  work  restricts  logic  programs  to  only  binary  relations  and  shows  how 
to  combine  the  backtracking  of  logic  programming  and  the  higher  order  opera¬ 
tions  of  functional  programming.  This  combination  allows  us  to  define  a  richer 
set  of  program  forming  operations  that  includes  program  inversion.  We  include 
new  optimization  rules  and  point  out  which  FP-Iike  rules  are  invalid  in  a  rela¬ 
tional  context.  As  an  example,  we  show  how  the  algebra  is  used  by  synthesizing, 
from  specifications,  an  efficient  program  for  the  N  queens  problem. 

Introduction 

The  purpose  of  this  paper  is  both  to  show  the  advantages  of  describing  programs  as  binary 
relations,  or  more  accurately  "set  valued  functions”,  and  to  give  laws  relating  program  forming 
operations  whose  arguments  arc  relations.  As  an  exhibition  of  the  power  in  an  algebra  of  binary 
relations  we  synthesize  a  program  for  the  N  queens  problem  from  a  general  specification. 

Warren  (13]  described  how  to  define  higher  order  operators  in  Prolog  and  argued  that 
extending  the  Prolog  language  definition  to  include  them  was,  in  large  part,  unnecessary.  As  he 
says,  they  do  not  extec  .  the  power  of  the  language. 

One  of  Warren’s  examples  is  an  accumulation  operator,  called  iterate.  He  says  that  if  predi¬ 
cate  variables  are  used  in  more  than  small  doses  the  program  becomes  excessively  abstract  and 
therefore  hard  to  understand. 
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iterate( |j,R, Identity  identity ). 
iterate({FirstjRest],R, Identity  .Result) 

itera>.e(Rest, R, Identity  .MidVaiue), 

R(First,MidVa!ue, Result). 

We  can  think  of  the  last  argument  as  the  result  of  accumulating  the  relation  R,  with  identity 
’Identity',  over  a  sequence.  The  first  clause  says  that  the  result  of  accumulating  the  relation  R 
over  an  empty  sequence  is  the  identity.  The  second  clause  breaks  the  sequence  up  into  a  head 
and  a  tail,  accumulates  over  the  tail  of  the  sequence  to  find  a  middle  value,  and  finally  applies  R 
to  the  head  of  the  sequence  and  the  middle  value  to  give  the  result.  Some  examples  of  uses  of 
this  operator  are 

iterate([l,2,3,4],(+),0,10). 

iterate([l,2,3,4],(*),!,24). 

iter3te(|(l,2l,(l,|3,4]],append,[],[l,2,3,4j). 

These  definitions  are  difficult  to  read  partly  because  variables  that  represent  predicates  and 
variables  that  represent  objects  (numbers)  are  mixed  in  at  the  same  level.  This  straightforward 
mixture  of  higher  order  operators  and  Prolog  clauses  leads  to  a  difficulty  of  combining  programs 
defined  with  operators. 

We  argue  that  higher  order  operators  allow  us  to  lift  our  level  of  programming.  We  can 
write  programs  that  operate  in  bigger  chunks.  Operators  act  as  recursion  pattern  templates  and 
save  the  programmer  from  always  relying  on  assertions  based  on  induction  to  define  his  recur¬ 
sions.  We  can  also  give  concise  optimization  rules  between  different  operators.  So  we  are  claim¬ 
ing  that  higher  order  operators  can  play  an  important  role  in  logic  programming.  On  the  other 
hand,  it  is  commendable  that  such  operators  are  so  easily  defined  in  Prolog. 

A  program  forming  operation  is  a  very  useful  tool  for  accomplishing  sophisticated  data 
abstraction.  The  solution  that  is  being  proposed  here  depends  heavily  on  composition  of  relations 
as  a  program  forming  operation  (PFO)  and  allows  other  PFOs  to  name  only  their  relational  argu¬ 
ments.  A  PFO  with  its  arguments  is  a  term  that  is  to  be  rewritten  in  a  forward  direction.  The 
final  result  of  this  term  rewriting  is  also  a  relation.  Backtracking  occurs  at  the  lower,  object  level. 

An  example  of  a  PFO  is  map.  The  term  map(F)  is  a  new  relation  between  inputs  and  out¬ 
puts  (unnamed)  that  is  formed  from  the  relation  F.  If  F  is  a  relation  of  type  A  x  A,  i.e.  it  is  a  set 
of  ordered  pairs  of  type  A,  then  the  relation  map(F)  is  of  type  *A  x  *A,  a  set  of  ordered  pairs  of 
sequences  of  type  A.  Thus  map  is  a  function,  a  PFO,  from  relations  into  relations;  its  type  is 
AxA  ->  *Ax*A. 

We  will  distinguish  two  levels  of  program  definition.  The  first  describes  how  to  relate 
objects  and  allows  definition  of  new  nondeterministic  relations.  The  second  level  allows  definition 
of  new  relations  only  in  terms  of  program  forming  operations  or  previously  defined  relations.  New 
PFOs  can  be  defined  at  the  second  level.  No  new  nondeterminism  can  be  introduced  at  the 
second  level. 

Our  solution  includes  restricting  our  attention  to  binary  relations  and  depending  heavily  on 
an  infix  functor.  for  composition  of  relations. 

In  first  level  definitions  new  relations  are  created  by  asserting  set  memberships.  An  infix 
operator  'in’,  meaning  ’element  oF,  will  represent  a  test  of  membership.  Thus  '(X,Y)  in  R’  can  be 
read  as  ”R  is  applied  to  (is  true  of)  the  pair  (X,Y)”  or  as  ”(X,Y)  is  an  element  of  the  relation  R.” 
This,  along  with  a  representation  of  infix  composition  of  relations  that  doesn't  require  us  to  talk 
about  arguments,  gives  us  a  powerful  and  expressive  notation. 

Therefore  a  first  level  definition  for  map(F)  would  be 


(Il.il)  >a  map(F). 

(jA|B),|C|Dj)  in  map(F)  > 

(A,C)  in  F, 

(B,D)  in  map(F). 

and  redaction  can  be  defined  a s 

({),Z)  in  accum(F,Z). 

(|A|B],D)  in  accum(F,Z) 

((A.Z),X)  in  F, 

(B,D)  in  accum(F,X;. 


A  third  operator  is  ’gen(P,F)’  which  generates  successive  applications  of  F  until  P  holds. 

(X,(Xj)  in  gen(P,F) 

(X.true)  in  P. 

(X,(X|Yj)  in  gen(P,F)  > 

(X, false)  in  P, 

IX, Z)  in  F, 

(Z,Y)  in  gen(P,F). 

For  example,  it  holds  that  (5, (5, 4, 3, 2,1])  in  gen(eql,subl),  ({3, 2,1], [9, 4,1])  in  map(sqr),  and 
((5,4, 3, 2,1], 15)  in  accum(+,0). 

We  interpret  F;G  to  mean  that  F;G  is  true  of  (A,B)  if  F  is  true  of  (A,C)  and  G  is  true  of 
(C4J). 

(A,B)  in  F;G 

(A,C)  in  F, 

(C,B)  in  G. 

As  an  example  we  can  compute  the  sum  of  squares  of  a  few  integers  as 
(]1,2,3,4],Y)  in  map(sqr);accum(+,0). 

The  second  level  definitions  state  an  equivalence  between  relations,  ‘lie  left  hand  side  is  the 
more  abstract  version;  the  right  hand  side  the  more  desirable  from  an  t.  ~ncy  standpoint.  For 
example,  we  may  une  inner  product  of  two  vectors  as 

ip  — >  transpose;map(*);accum(+,0). 

Where  ’transpose’  transposes  a  matrix. 

Second  level  definitions  are  not  restricted  to  just  new  relations  but  can  also  be  used  to  give 
optimization  rules.  For  example  a  loop  merging  optimization  rule  for  map  can  be  stated  as 

msp(F);map{G)  =>  map(F;G). 

with  the  understanding  that  means  ’is  equivalent  to  and  is  to  be  rewritten  as’,  i.e. 

F  =>  G  iff  ((X,Y)  in  F  iff  (X,Y)  in  G). 

Now  we  can  painlessly  add  nev  higher  order  operators  and  write  second  level  definitions 
that  are  like  "logic  program-  in  the  style  of  AFL.”  "because  composition  will  be  from  left  to  right 
it  is  more  suggestive  of  FQL  [6],  The  arguments  given  by  Backus  ]l]  apply  to  programs  of  this 
sort. 

We  write  definitions  at  the  second  level  as  rewrite  rules  without  bringing  ourselves  down  to 
the  object  level.  Even  if  the  objects  in  question  are  nondeterministically  specified  we  can  still 
use  rewrite  rule  theory  to  reason  about  these  program  forming  operations.  Since  they  are  deter¬ 
ministic,  and  if  we  are  careful,  the  rules  can  be  der  ned  to  obey  ChurcH-Rosser  properties.  As 
another  example,  we  may  assert  a  definition  for  the  inner  product  of  two  rectors,  i.e. 
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ip  =>  transpose;map{*);acr!m(-t-,0). 

Where  transpose  transposes  a  matrix.  The  symbol  ’ip’  represents  aa  abstraction  of  the  computa¬ 
tion  ’transpose;map{*);accum(+,0)’.  Aa  example  of  the  reasoning  that  would  go  along  with  such 
a  definition  is 

(III, 2  3j,|4,5,6]j,Y)  in  ip 

=>  (({1,2,31,14, 5, 6)|,Y)  in  transpose;map(*);accum(-r,0). 

=>  (|(1,4],{2,5],[3,6]],Y)  in  raap(*);accum(+,0). 

~>  ([4,10,18j,  if)  in  ac^umj-f  ,0).  • 

=>  Y=32. 

Selectors,  Constructors  and  Nottdetcrndnkm 

A  convenient  way  of  sharing  aa  inpat  to  more  than  one  relation  is  with  Backus'  constructor 
functional.  We  read  [F|G]  as  a  sequence  ol  relations  whose  head  is  F  and  whose  tail  is  G.  The 
relation  [FIG]  is  true  of  an  input/ontput  pair  if  F  is  true  of  the  input  and  the  head  of  the  output 
and  G  is  true  of  the  input  and  the  tail  of  the  output.  More  formally, 

(X,H)  in  H. 

(X,|U|V))  in  fFjGj 
(X,U)  in  F, 

(X,V)  in  G. 

here  it  is  understood  that  since  [U|V]  is  in  the  object  position  then  U  and  V  represent  components 
of  a  sequence  of  objects.  For  example,  (3, {9, 27])  in  [sqr.cube]. 

In  like  fashion  we  define  selectors  that  disassemble  what  the  constructors  build.  We  capture 
the  Kth  component  ol  a  sequence  with  #K.  The  relation,  tl,  gives  ns  the  rest  of  a  sequence.  The 
.identity  is  id. 

(X,X)  in  id. 

((X|Y],X)  in  #1. 

((X|Y],Z)  in  #K 
.1  is  K-I, 

(Y,Z)in  #J. 

(|XlY],Y)intl. 

We  try  to  localize  instances  of  nondeterminism  to  easily  transform  programs  that,  do  net  use 
rondetermiuism.  In  this  paper  we  describe  all  nondeterminism  with  a  'select*  relation  A 
sequence  is  the  input  to  select.  The  value  returned  from  it  is  &  pain  one  (any)  element  oi  the 
sequence  and  a  subsequence  of  all  other  elements.  We  can  describe  tbs  set  of  ordered  pairs  as 

([X|Y],  [X,Y])  in  select. 

(|X|Y],  1U,{X|V]J)  in  select:- 

(Y,  [U,V])  -a  select. 

Some  sample  pairs  in  the  relation  select  arc-  ({1,2, 3], (1,(2, 3]]),  ({1,2,3], [2.(1, 3]])  and 
((1,2,31,(3,{1,21J).  Yhct,  »  are  all  the  ways  of  selecting  an  element  from  the  sequence  [1,2,3]. 

Gvards 

In  our  ref  -.esentati'iou  of  computations  as  binary  relations  we  often  must  package  a  collection 
of  inputs  into  a  single  structured  input.  for  example,  the  Prolog  clause  append(A,?.<,C)  must 
be  written  as  ((A,B],C)  in  append.  Likewise,  we  wil1  often  need  to  extend  predicates  of  single 
arguments  so  that  they  can  return  Boolean  values,  for  example  (3, true)  in  cdd.  These  extensions 
are  important  because  we  can  use  higher  order  operators  with  Booier  valued  relations,  e.g. 


every(P)  — >  map(P);accum(and,true). 

Thus  ’every(odd)’  can  check  every  element  of  a  sequence  for  oddness,  e.g.  ((1,3,17], true)  in 
every  (odd). 

A  guard  is  any  Boolean  valued  relation.  Backtracking  choices  are  created  with  ’select’  and 
pruned  with  guards.  These  definitions  are  patterned  after  Dijkstra’s  guarded  command  language 
|5j  except  that  our  nondetermism  is  uncommitted.  In  (P  ->  F /,  P  is  a  precondition  for  the  input 
to  F  and  in  (F  <-  P),  P  is  a  postcondition  for  the  values  returned  from  F.  More  precisely, 

(A,B)  in  (P  ->  F) 

(A, true)  in  P 
(A,B)  in  F. 

Postconditions  are  appropriate  for  the  ’generate  and  test’  paradigm.  This  form  tests  the 
oatput  of  a  relation  with  a  guard. 

(A, 6)  in  (F  <-  P):- 
(a,B)  in  F, 

(B.true)  in  P. 

The  first  clause,  (A,B)  in  F,  may  generate  several  feasible  solutions  for  B,  only  a  '  c  which 
may  satisfy  P  For  example  an  inefficient  way  to  sort  a  sequence  is  to  permute  the  i-  q»  mce  and 
then  check  that  it  is  ordered. 

sort  =>  (perm  <-  ordered). 

Inversions  with  respect  to  composition 

An  unusual  program  forming  operation,  one  that  can  play  a  major  it  is  u  program  transfor¬ 
mations,  is  the  inverse.  The  program  inv(F)  is  formed  from  F  by  running  £  backwards, i.e.  with 
known  output  but  unknown  input.  The  following  is  tz  obvious  definition  but  is  often  useless  in  a 
sytem  where  goals  are  solved  in  depth  first  order. 

(X,Y)  in  inv(F) 

(Y,X)  in  F. 

For  example  the  expression  ((],Y)  in  inv(accum(append,Q))  cannot  be  solved  for  Y  with  a 
depthfirst  search. 

However,  the  concept  turns  out  to  be  useful.  It  allows  us  to  retain  a  left  to  right  order  of 
computation  and  make  effective  use  of  previously  defined  relations.  By  assuming  that  left  hand 
arguments  (inputs)  are  known  and  that  outputs  are  to  be  determined,  we  can  control  the  extent  of 
our  searches.  This  is  done  by  ordering  our  subgoais  so  that  the  computation  tree  is  not  bushy  but 
stringy.  Unfortunately,  the  above  definition  for  inverse  is  not  an  efficient  computation  rule  and 
may  even  lead  to  a  nonterminating  computation. 

A  logic  programming  system  with  the  completeness  property  (all  clauses  with  a  solution  can 
be  solved)  cannot  carry  out  all  searches  in  depth  first  order.  However,  a  ♦otaiiy  breadth  first 
search  is  overkill  because,  although  such  a  metL ‘  **  complete,  it  is  ineffici  nt.  We  want  to  do 
most  searches,  those  that  are  assured  of  terminate  in  depth  first  fashion  aT  d  c?ny  out  the  rest 
breadth  first.  The  problems  caused  by  the  intension  of  inverses  with  the  depth  first  solution 
strategy  are  being  examined  in  another  paper  (2j.  It  includes  a  collection  of  operator  directed 
transformation  rules  that  ’solve’  the  program  for  an  inverse  program. 

Limitations  of  the  algebra 

If  true  functions  are  evaluated  deterministically  there  is  still  a  practical  concern.  It  is  often 
difficult  to  prove  termination  even  for  deterministic  computations.  Our  approach  is  to  interpret 
the  constructor  function  under  a  lazy,  rather  than  strict,  semantics.  In  fact,  lazy  evaluation  is 


required  in  order  to  talk  about  potentially  unbounded  structures  which  arise  when  interacting 
with  external  physical  devices. 

ihe  following  logic  program  fails  if  g(X,V)  fail'.  But  failure  may  not  be  the  desired 
behavior  if  jUjV]  represents  a  potentially  unbounded  sequence  of  which  we  only  need  the  head. 

h(:.,U) f(X,U),g(X,V). 

Equivalently,  the  rule 

[F,G];#1  =>  F. 

dots,  net  hold  under  a  strict  semantics  for  at  least  two  reasons.  As  stated,  if  G  fails  either  by 
returning  an  undefined  value  or  by  not  terminating  then  the  left  band  expression  fails  whereas  the 
right  hand  side  may  still  succeed. 

A  less  obvious  discrepancy  involves  domain  elements  with  uninstantiated  variables.  If,  for 
example,  G  succeeds  only  when  the  input  is  guarded,  then  G  may  instantiate  an  undefined  vari¬ 
able  to  that  value.  Therefore  F  in  the  left  hand  expression  will  be  solved  under  the  assumption 
that  G  succeeds  and  would  be  more  defined  than  what  the  right  hand  side  would  find  as  a  solu¬ 
tion. 

Mu;c  Lfe-ilically,  consider  the  following  relation 
|id,(eq4  ->  id)];#l. 

It  is  not  eq-vvaient  to  ’id.’  The  difference  is  noticed  if  the  relation  solved  with  both  unspecified 
domain  and  range  elements. 

(X,Y)  in  [id,(eq4  ->  id)j;#l 
=> 

(X,U)  in  id, 

(X,V)  in  (eq4  ->  id), 

|U,V|=*Z,  #1:(Z,Y). 

=> 

X=U, 

(X,V)  in  (eq4  ->  id), 

|U,V]=Z, 

U=Y. 

==> 

X~Y, 

(X.trne)  in  eq4, 

X=V. 

=> 

X— Y, 

(4, true)  in  eq4, 

X=4. 

=> 

X=4, 

Y=4. 

This  solution  is  more  specific  than  the  solution  c? 

(X,Y)  in  id. 


This  point  must  be  considered  when  optimizing  programs  whose  domains  include  ’difference 
lists.’  The  term  difference  list  is  often  used  to  refer  to  an  incompletely  defined  data  structure  that 
has  become  increasingly  instantiated  during  a  computation.  However,  the  method  could  also 
apply  to  data  structures  ether  than  lists  suck  as  trees.  Operations  such  as  the  append  of  two 
difference  lists  can  be  performed  in  constant  time  because  they  can  involve  just  the  instantiation 
of  variables  instead  of  copying  of  entire  structures.  Another  example  is  lie  process  of 
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maintaining  a  queue;  insertion  of  a  new  element  can  be  done  in  constant  time,  if  access  to  the 
uninstantiated  variable  at  the  end  of  the  queue  is  shared  and  immediate. 

Although  there  are  methods  for  accessing  and  destructively  updating  recursive  data  struc¬ 
tures  ([9j)  in  functional  languages,  they  Involve  more  complex  concerns  and  are  more  restrictive. 
In  particular  the  paths  of  (9j  do  not  allow  sharing.  Therefore  only  trees  and  no*  DAGs  or  graphs 
can  be  treated. 

We  define  conditions  under  which  we  can  apply  these  rules.  One  constraint  that  would 
assure  applicability  is  to  require  that  every  domain  element  be  completely  defined  and  that  G  ter¬ 
minate  on  this  value.  This  requirement  is  so  strict  that  it  rules  out  both  difference  lists  and 
streams  and  is  thus  unsatisfactory. 

Our  solution  is  to  isolate  on  a  particular  kind  of  difference  list.  Instead  of  creating  two 
values  with  the  constructor  functional,  we  return  a  pair  of  suspensions,  i.e.  promises  to  compute 
the  values.  We  will  read  the  clause,  ’X  suchthat  P’  as  ’a  value  X  that  has  property  P.'  So  we 
replace  our  previous  definition  of  the  constructor  with 

(X,({U  suchthat  (X,U)  in  F)1(V  suchthat  (X,V)  in  G)]) 
in  [F|Gj. 

(X,|I)  in  (j. 

and  replace  our  previous  definition  of  composition  with 
(X,Y)  in  F;G 

(Z  suchthat  (X,Z)  in  F),Y)  in  G. 

We  must  also  include  rules  that  recognize  suspensions  if  their  values  are  needed. 

(X  suchthat  P,Y)  in  F 
var(X), 

P, 

(X,Y)  in  F. 

(X  suchthat  P,Y)  in  F  > 
nonvar(X), 

(X,Y)  in  F. 


These  are  equivalent  to  the  rules  that  describe  lazy  evaluation’  or  ’call  by  need’  [12],  [8],  [7] 
although  our  definition  is  much  simpler.  Voillemin  has  shown  that,  as  an  argument  evaluation 
rule,  this  mechanism  is  optimal.  If  values  are  never  requested  from  subcomputations,  then  they 
never  get  computed.  Lazy  evaluation  allows  a  more  expressive  style  of  programming  because  ter¬ 
minating  conditions  can  often  be  ignored.  Cue  can  conceptually  build  and  manipulate  infiniu 
structures  or  streams. 

The  greatest  disadvantage  is  that  there  is  a  constant  overhead  associated  with  making  every 
structure  a  strea"  .  The  work  of  Mycroft  [11]  and  Hudak  [10]  might  be  adjusted  to  optimize  some 
lazy  computations  into  equivalent  strict  computations.  The  greatest  advantage  that  lazy  evalua¬ 
tion  holds  for  us  is  that  algebraic  laws  are  valid  without  other  conditions  such  as  success  of 
irrelevant  subcomputations.  The  computation  carried  out  with  lazily  evaluated  arguments  agrees 
with  wbat  a  mathematician  would  e*  pect  if  he  did  the  algebraic  manipulation  by  hand.  Thus  the 
rule 


[F,G];#1  =>  F 

holds  independently  of  whether  G  succeeds  or  fails.  This  is  proved  by  showing  that  [F,G];#1  is  a 
sub.elation  of  G  and  vice  versa. 

The  following  is  a  simple  example  of  stream  oriented  programming.  It  is  a  program  that 
returns  the  infinite  list  of  powers  of  two. 


powersoftwo  —  >  Ql;doubleall. 
doubleall  — >  {id|times2;doubleal]]. 
times2  =>  [id,©2];(*). 

The  relation  Q1  is  a  constant  function.  It  ignores  all  inputs  and  returns  1  as  an  output.  The  con* 
structor  in  ’doubleall’  must  be  evaluated  lazily. 

There  are  concerns  to  be  voiced  about  interactions  with  backtracking  and  streams.  A  pro* 
gram  that  returns  an  infinite  structure  cannot  be  inverted.  We  must  be  careful  about  construct¬ 
ing  streams  because  we  cannot  backtrack  over  a  partially  built  structure  that  h?s,  for  example, 
already  been  printed.  This  point  is  relevant  to  the  N  queens  problem  that  we  consider  in  a  later 
section. 

Toward*  an  algebra  of  cendetermlnlstle  program* 

Before  we  look  at  the  example  we  first  consider  the  rules  we  will  be  using.  Some  of  the  rules 
from  functional  programming  do  not  hold  when  combining  relations.  In  particular,  expressions 
that  contain  nondeterminism  cannot  be  replicated  or  removed  without  care.  We  cannot,  for 
example,  distribute  a  nondcterministic  expression  whose  results  are  shared  ([4]).  The  rule  from 
Backus'  FP  algebra  [l]  that  distributes  shared  input  to  a  constructor  functional  holds  only  if  the 
program  whose  result  is  shared  is  a  function,  i.e. 

F;(G,H1  =>  (F;G,  F;H] 

deterministic(F). 

is  valid  only  if  F  is  deterministic.  Also 
lF,Gl;#i  =>  F 

deterministic(G). 

requires  that  the  number  of  replications  of  (F,G]  be  determined  only  by  the  nondeterminism  in  F, 
not  in  G.  This  is  because  our  relations  are  built  on  a  multiset  model  instead  of  on  sets.  In  gen¬ 
eral,'  we  must  be  careful  about  applying  any  rules  that  change  the  number  of  occurrences  of  an 
expression. 

Finally,  if  we  maintain  a  lazy  constructor  semantics  we  cannot  propagate  guards  into  the 
component  of  a  constructor.  For  e  pie, 

(|F,G)  <-  #1;P) 
is  a  subrelation  of 

I(F  <-P),G|. 

Some  of  the  following  rules  will  be  used  in  the  derivation  of  the  N  queens  program. 

(F  <-  G;P)  ~>  (F;G  <-  P);inv(G) 
if  G;inv(G)  =>  id. 

xaap(F),cvery(P)  **>  every(F;P). 

(F;G  <-  P)  =>  (F  <-  G;P);G 

if  G  is  deterministic 

gen(P,F);tl  =>  F;gcn(P,F). 

gen(P,F);last  =>  ioopuutil(P,F). 

(gen(P,F)  <-  every (Q))  —  >  (gen(P,(F  <-  Q))  <-  defined) 


The  last  rule  shows  one  problematical  interaction  between  streams  and  backtracking.  The 
expression  gen(P,(l’‘  <-  Q))  generates,  until  P  becomes  true,  a  stream  whose  elements  satisfy  Q. 
If  somewhere  later  in  the  stream  Q  fails,  then  the  entire  stream  becomes  invalid  as  an  answer.  In 
operational  terms  we  should  "back  up  the  line  printer,  erasing  invalid  characters.” 

We  avoid  this  problem  by  adding  the  postcondition  ’defined’  to  strictify  the  stream.  There* 
fore  the  stream  is  not  passed  as  valid  until  the  entire  structure  has  been  created  and  checked. 

The  N  Queens  Problem 

The  problem  is  tc  place  N  queens,  on  an  N  'by  N  chessboard,  in  a  manner  such  that  no 
queen  is  attacking  any  other.  This  problem  is  a  common  example  of  backtracking,  however  the  N 
queens  problem  is  simpler  to  solve  if  we  don’t  think  of  it  as  a  traditional  backtracking  problem. 
Instead,  we  just  consider  applying  a  restricting  condition  to  a  set  of  possible  answers.  In  other 
words,  we  view  it  as  just  another  instance  of  the  generate  aid  test  paradigm. 

We  start  with  a  simple,  understandable  version  of  the  N  queens  problem  that  generates 
every  possible  board  position  and  then  tests  each  for  validity.  Thror  jh  the  successive  application 
of  rewrite  rules,  we  transform  this  program  to  a  more  efficient  one.  However,  we  do  not  prove  the 
rules  in  this  pape:-. 

The  N  queens  problem  is  first  solved  by  generating  a  feasible  set  and  then  testing  each  to 
see  if  all  queens  are  safe.  The  original  feasible  set  is  just  the  set  of  all  permutations  of  placements 
of  the  N  queens  in  N  columns.  So  initially  there  are  N  factorial  elements  in  the  set.  Ibe  relation 
that  generates  this  set  is  just 

h>ta;perm. 

One  elemental  pair  in  thb  relation  is  (4, [3, 2, 1,4]).  This  is  a  placement  of  queens  by 
columns:  first  column,  third  row;  second  column,  second  row  and  so  on.  This  is  only  a  tentative 
solution  and  not  a  solution.  A  solution  is  (4,[2,4,l,3j). 

We  use  a  random  selector  relation  that  picks  an  item  from  within  a  stque  .ee.  This  is  the 
source  of  ail  nondetenninbra  in  the  N  queens  problem.  A  logic  programming  definition  for  select 
would  be 

(p£jY],|X,Yj)  in  select. 

([X|Y],(U,|X|V]])  in  select:* 

(Y,[U,VJ)  in  select. 

These  permutations  are  screened  according  to  the  condition  that  there  must  be  no  diagonal 
conflicts.  The  representation  makes  it  impossible  to  have  row  or  column  conflicts 

Our  specification  of  the  N  queens  program  is  reminiscent  of  Clark  and  Darlington’s 
specification  of  sort(3].  That  specification  generates  all  permutations  of  the  sequence  and  accepts 
only  those  that  are  ordered.  Several  efficient  sort  algorithms  are  synthesized  from  thb 
specification.  Correspondingly,  we  generate  all  permutations  of  1..N  and  accept  only  those  boards 
where  all  queens  are  safe.  Our  straight  forward  but  inefficient  program  for  N  queens  b  as  follows. 

nqueens  «>  fiotajperm  <•  nonesunediag). 


where 


;  *  i  .  -  x  ,  --•* 


-«  -*  guyuirjyujya^.wiiUMj^’j.fliiu^ 


iota  — >  gen(eql,subl);rev. 

perm  =>  j|j,idj; 

gen(#2;nuli,  (#l,#2;select}; 

|(#2;#1|#1],  #2;#2l); 
tl;map(#l);last. 

nonesamediag  =>  splitoff; 

every(noconflict). 

noconflict  =>  J#l,tl]; 

fdistl;map(absdiff),  #2;len;k>ta); 
trans;eveiy(ne). 

splitoff  =>  gen(tl;null,tl). 

every(P)  — >  map{P);aceum(and,true). 

Our  objective  is  rewrite  this  inefficient,  but  straightforward,  definition  into  a  more  efficient, 
but  possibly  less  understandable,  program. 

Beiore  w c  continue  along  these  lines  there  is  a  particular  kind  of  sequence  of  sequences  that 
needs  attention.  We  define  a  fading  permutation  as  a  sequence  with  these  properties: 

1.  The  head  of  the  sequence  is  a  permutation. 

2.  The  i+lth  element  of  a  sequence  is  the  tail 
of  the  ith  element. 

3.  The  last  element  is  itself  a  sequence  but 
with  one  element. 

Given  a  permutation  the  following  program  is  one  way  to  create  a  fading  permutation. 
gen(tl;null,  tl). 

We  must  recognize  the  fact  that  the  program 
#l;gen(tl;null,  tl) 

is  an  identity  on  fading  permutations.  We  return  to  the  original  problem. 

We  can  rewrite  nqueens  as 

nqueens  =*>  (iota;perm 
<- 

gen(tl;null,tl);every(noconflict)). 

Noting  that  #1  is  a  right  inverse  for  splitoff  we  have 

nqueens  — >  (iota;perm;gen(U;u«i!l,uj 
<- 

every(noconflict)); 

#1. 

We  concentrate  oui  attention  on  perm;splitoff  and  substitute  rev;#l  for  Imt. 
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perm;gen(tl;nuli,tl) 


=  >  [0,idj;gen(#2;nnU,  |#l,#2;seleuj; 

li#2;#l|#l|,  #2;#2|); 

tl;map(#l);rev;#l; 

gen(tl;null,tl). 

=*>  |fl,id|;gen(#2;anll,  (#1,  #2;select); 

j{#2;#l|#l],  #2;#2j); 
tl;map(#l);rev. 

as  long  as  we  know  that  this  final  expression  generates  fading  permutations. 

To  show  that  this  generates  fading  permutations  we  note  that  the  first  element  of  a  result¬ 
ing  sequence  (after  being  reversed,  i.e.  originally  the  last)  is  a  permutation.  This  is  because  the 
first  element  of  this  expression  is  equivalent  to  the  definition  of  perm.  Next  we  note  that  succes¬ 
sive  elements  are  sequences  that  differ  only  by  a  missing  first  element.  Th»  i  the  i  f  1th  element  is 
the  tail  of  the  ith  element.  Finally  wc  note  that  the  last  element  of  the  fading  permutation  has 
only  one  element.  From  these  three  facts  we  claim  that  our  use  of  the  simplification  rule  is 
justified. 

Now  we  note  that  since  ’and’  is  associative  and  commutative  we  can  say 
rev;every(P)  *=>  every(P). 

A*ao  we  know  that 

(F;G  <-  P)  =>  (F  <-  G;P);G. 

where  G  is  deterministic.  Using  these  two  rules  we  now  have  the  expression 

nqueens  => 

(iota;{{J,idJ; 

gen(#2;null,  {#1,  #2aelect|; 

i[#2;#l|#ll,#2;#2j);tl 

<- 

map(#  1  );every  (noconflict)); 
map(#l);last. 

We  can  move  tl  out  of  the  way  with  the  following  rule 
gec(P,F);tl  «>  F;gen(P,F) 

and  we  can  merge  map(#l)  with  every(noconflkt)  with 

map(F);every(°)  =>  every (F;P). 

Finally  the  constraint  that  for  every  new  queen  there  be  no  conflict  can  be  propagated  into 
the  gen  loop  with  the  following  rule 


k 

I 

i 

0 


gen(P,F)  <-  every(Q)  =>  (gen(P,(F  <-  Q))  <-  defined). 


So  we  now  have  the  expression 


nqueens  — > 
iota; 

IB.M1; 

j#l,#2;select); 

|(#2;#ll#ll,#2;#C];  • 

gen(#2;n«U,  ((#!,  #2;selectj; 

l!#2;#l|#l),  #2;#2) 

<  -  #l;coconflict)); 

map(#l); 

last. 


or 


nqueens  => 

[{]  ,iota;select] ; 

[i#2;#l|#l|,#2;#2]; 
loopuntil(#2;null,  ({#1,  #2;select|; 

II#2;#1|#1),  #2;#2j 

<-  #l;noccnflict)); 


#1- 

This  final  program  is  more  obscure  but  much  more  efficient  than  the  original. 


Further  Work 

A  system  that  can  do  these  transformations  automatically  would  have  to  include  a  fairly 
large  collection  of  rules,  although  the  operator  direction  helps  to  reduce  the  number.  We  are 
working  on  rules  that  can  derive  inverses  of  binary  relations.  They  are  important  for  this  work 
and  should  be  more  generally  useful  to  other  kinds  of  program  tranbformations  uuch  as  *he 
Burstall-Darlington  unfold/fold  method. 

Even  the  synthesis  shown  in  this  paper  has  some  weak  L'pots  that  need  filling.  The  mo6t 
direct  definition  of  ’perm’ 

perm  => 

sekr>'; 

gen(..  2;null,  #2;select); 

map(#l). 

does  not  easily  lead  to  the  program  given  here  although  this  does  not  appear  to  be  a  serious  prob¬ 
lem.  We  also  should  be  able  to  transform  the  definition  of  ’noconltict’  into  a  more  efficient  form 
such  as 


noconflict  => 

[3 true,  ©1,  #1,  tlj; 
loopu»til(#';d;nuU, 

H#l,|#2,I#3,#4;#ll;absdiffj;ne);and> 

#2;addl, 

#3, 

#4;tl|); 

#1. 

Transforming  ’noconflict’  into  this  program  seems  to  require  a  different  collection  of  rules  than 
given  in  this  paper. 


Discussion  of  Application* 

As  the  .inds  of  programs  we  write  become  more  complex  and  more  ambitious,  they  become 
more  difficult  to  write  efficiently.  In  particular,  when  separate  programs  are  combined,  their  loops 
should  often  be  combined.  The  gap  between  efficiency  and  generality  is  becoming  more  difficult 
to  bridge  and  the  need  for  program  transformation  systems  is  becoming  apparent.  This  work 
views  such  a  program  transformation  system  as  a  kind  of  computer  algebra  package  that  manipu¬ 
lates  programs  as  expressions. 

A  second  important  application  includes  the  formal  modeling  of  the  nondeterrainism 
inherent  in  message  passing  in  networks.  We  hope- that  properties  of  a  model  can  be  proved  by 
algebraic  simplification  and  transformation  in  place  of  a  simulation. 

Conclusions 

We  have  exhibited  an  operator  directed  method  for  reasoning  about  nondeterministic  pro¬ 
grams.  The  notation  is  a  combined  form  of  functional  programming  and  logic  programming. 
This  method  does  not  depend  on  recursion  and  its  associated  induction  arguments  but  instead 
represents  patterns  of  recursion  with  program  forming  operations  and  uses  rules  relating  them. 

The  most  important  result  in  this  paper  i6  the  synthesis,  from  a  formal  specification,  of  an 
efficient  backtracking  program  for  the  N  queens  problem.  We  used  the  method  of  propagation  of 
constraints  so  that  nonsolutions  are  rejected  before  they  are  constructed.  We  showed  that 
streams  and  backtracking  may  interact  in  unexpected  ways  and  pointed  to  the  occasional  need  to 
’strictify’  streams. 
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ABSTRACT 

Error  control  codes  having  modest  error  correction  capabilities  and  additional  error 
detection  features  often  are  used  beyond  their  correction  abilities  to  force  retransmission  of 
messages  under  severe  channel  conditions.  High  noise  or  interference  levels  can  cause  a  significant 
fraction  of  messages  to  experience  multiple  transmissions  in  order  to  provide  successful  delivery. 

To  improve  the  reliability  of  message  delivery,  a  second  level  of  error  control  is  examined. 
A  aonbinary  Reed-Solomon  (RS)  code  treats  each  detected  character  error  as  an  erasure:  the 
location  of  the  error  is  known,  but  the  identity  of  the  transmitted  symbol  is  net.  The  decoder 
then  fills  in  symbols  which  the  channel  has  "erased.” 

The  performance  of  such  an  error  control  scheme  in  a  typical  application  is  studied,  and 
the  assumed  scheme  is  shown  to  provide  significant  improvement  for  a  modest  change. 


t.  INTRODUCTION 


Concern  about  the  numbers  of  message  retransmissions  often  required  by  the  occurrence  of 
detectable  but  not  concciabic  error  patterns  on  degraded  communication  channels  has  prompted 
corrective  suggestions  including  transmitting  each  message  twice  and  eliminate  acknowledgement 
messages.  This  suggestion  probably  will  not  play  well  in  Peoria  because  the  military  arc  rot  likely  to 
give  up  the  unambiguous  report  which  they  feel  is  provided  by  receipt  of  ACK. 

Typically,  an  error  correcting  code  with  modest  error  correction  capabilities  and  the  ability  to 
detect  somewhat  more  severe  error  conditions  is  used  to  correct  some  channel-induced  errors  and  to 
indicate  the  existence  of  others.  When  such  an  error  detection  condition  occurs,  the  datalink  protocol 
[TANE81]  suppresses  acknowledgement  of  receipt  of  the  message.  After  a  timeout  period,  the  source 
of  the  unacknowledged  message  will  retransmit  it  High  noise  or  interfererce  channels  can  cause  a 
significant  fraction  of  all  messages  to  require  multiple  transmissions  in  order  to  achieve  successful 
receipt. 

To  improve  the  reliability  of  message  reception,  a  second  level  of  error  detection  and  correction  is 
examined.  A  noribinary  Recd-Solomon  (RS)  code  can  treat  each  detected  character  enor  as  an  erasure: 
the  location  of  the  error  is  known,  but  the  identity  of  the  transmitted  symbol  is  mt  Decoders  for  error 
correcting  codes  not  only  correct  received  symbol  errors  but  also  fill  in  symbols  which  the  channel  has 
erased. 

In  what  follows,  the  performance  of  error  detection  and  correction  in  a  typical  system  is 
determined  for  a  representative  communication  channel,  and  augmentation  of  the  assumed  scheme  is 
shown  to  provide  significant  improvement  for  a  modest  change. 


n.  ERROR  DEI7X  nON  AND  CORRECTION  IN 
MESSAGE  COMMUNICATIONS 


A.  INTRODUCTION 

This  note  is  concerned  with  the  use  of  algebraic  or  block  error  correcting  ^  to  improve 
message  reception  in  message  handling  systems  using  very  noisy  communication  channels.  The 
performance  of  such  codes  is  outlined  in  this  section. 

Figure  1  shows  a  rondel  of  the  process  by  which  information  from  a  binary  source  (e.g^  a 
message  device)  is  conveyed  to  a  destination  in  an  accurate  and  timely  m«nn^  over  t 
communication  channel. 


Figure  1.  Model  of  Linear  Block  Cbdes 

To  each  block  of  k  information  bits  produced:  by  the  source,  the  encoder  appends  (n  -  k) 
redundant  bits,  each  computed  as  a  modulo-2  sum  (linear  combination)  of  at  least  some  of  the 
information  digits.  That  is,  they  transmit  no  additional  information  but  represent  a  form  of  controlled 
redundancy  which  is  exploited  at  the  decoder  in  order  to  recover  the  information  actually  transmitted.  It 
is  said  that  each  redundant  bit  is  a  ’parity  check’  on  the  information  bits  which  constitute  its  sum. 
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Tnc  sjucture  th-i  pmdjccd  is  called  an  (n,k)  linear  or  block  code.  (Mathematically,  a  linear 
fc  nek  code  ^  a  k-cLmcnsional  sub  .parse  of  the  vector  spare  of  n-tuples  over  a  finite  field  of  q  elements, 
where  q  Is  an  integer  power  of  a  prime  number  and,  t.i  th:s  paper,  is  2.)IPETE72].  The  value  of  k  is 
known  as  the  dimension  of  the  code:  n  is  its  length. 

Rules  for  selecting  the  subsets  of  information  digits  to  be  checked  by  a  parity  digit  arc  constructed 
so  as  to  make  the  codewords  pairwise  as  diiTcrcm  as  possible.  If  they  are  as  different  as  possible, 
correct  decoding  can  often  be  unambiguously  accomplished  by  selecting  as  the  transmitted  codeword 
that  which  is  "nearest*  to  the  received  n-tuple.  Thus,  while  encoding  produces  a  unique  n-tuplc  for 
every  block  of  k  information  digits,  decoding  must  map  many  n*tuples  into  one  k-tuple. 

B.  THE  BINARY  SYMMETRIC  CHANNEL 

The.  channel  model  is  shown  in  Figure  11.  The  random  binary  symbol  generator  produces  a 
binary  ONE  with  probability  p  and  a  ZERO  with  probability  (1  -  p).  Information  bits  from  the  source 
and  noise  bits  from  the  random  symbol  generator  arc  added  modulo-2  in  the  channel  to  produce  output 
bits.  Thus,  an  information  bit  will  be  invened  if  and  only  if  the  noise  bit  is  a  ONE,  and  we  say  that  the 
cnannel  bi:  error  probability  is  p.  It  is  always  assumed  that  p  <  0l5  because  if  p  —  0.5  the  charnel  can 
transmit  no  it  formation  [GALL68]  and  if  p  >  03  symbol  redefinition  will  make  p  <  03. 


RRH00H  BINARY  SYMBOL? 
CEHERflTOR  ( 


INPUT  - 


■  OUTPUT 


Figure  2.1  The  Binary  Symmetric  Channel 

This  channel  is  known  as  the  binary  symmetric  channel  with  transition  probability  p,  BSOfp),  and 
its  behavior  is  reprr^cn.ed  by  the  state  transition  diagram  of  Figure  2JL 


Figure  2.2  State  Transitions  in  the  Binary  Symmetric  Channel 

The  BSC  provides  a  convenient  vehicle  for  the  comparison  of  error  detection  and  correction  techniques. 
No  claim  is  made  as  to  its  fidelity  in  representing  actual  channels  although  it  is  a  valid  model  for  FM 
radios  carrying  binary  data  over  long  distances  in  the  absence  of  external  interfi  ,-jrce. 

c.  HAMMING  CODES  AND  SHORTENED  HAMMING  CODES. 

One  aim  of  code  design  is  to  make  the  codewords  as  different  as  possible  so  that,  when  corrupted 
by  channel  noire,  a  received  word  tends  i  be  nearer  the  word  transmitted  than  to  any  other  word.  For 
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transmission  over  ’lSC(p),  code  word  difference  is  expressed  as  Hamming  distance:  the  number  of 
positions  in  which  the  two  words  differ.  It  can  be  shown  [PETE721  that  the  minimum  distance  d 
between  two  words  over  a  given  code  guarantees  that  the  code  can  correct  any  error  pattern  of  t  or 
fewer  errors  provided 

t<[(d-l)/2] 


where  the  notation  indicates  the  integer  part  of  the  argument.  For  linear  block  codes,  the  minimum 
distance  between  two  codewords  is  equal  to  the  Hamming  weight  (number  of  non-zero  positions)  of  the 
minimum  weight,  non-zero  codeword. 

Detailed  structure  of  the  codewords  can  be  encapsulated  in  a  (k  x  n)  code  generator  matrix,  G.  A 
binary  k-tuple  is  encoded  by  postmuitiplying  it  by  G  to  pioducc  a  length  n  codeword.  Hence,  all 
codewords  are  linear  combinations  of  the  rows  of  G.  Each  linear  code 
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(2-0 


also  has  an  associated  parity  check  matrix,  H,  with  the  property  that  the  product  of  any  codeword  with 
the  transpose  of  H  Gives  zero: 

vHt  -  0  (2-2) 

The  relation  between  H  and  G  can  be  seen  by  ordering  :the  columns  of  H  so  that  it  assumes  the  form: 

H-tQlla-kl  (2-3) 

The  orthogonality  property  of  (2-2)  then  causes  the  code  generator  to  have  the  form  [LIN&83] 

G-[ljQT]  (2-4) 


where  Ij  is  the  jth  order  identity  matrix  and  T  indicates  matrix  transposiuon. 

Hamming  codes  IHAMM50]  arc  block  codes  having  the  capability  to  correct  exactly  one  error  per 
codeword.  If  an  additional  parity  check  is  computed  on  the  entire  codeword,  the  Hamming  decoder  can 
deiect  any  combination  of  two  errors  in  a  received  word  as  well. 

Codewords  have  length  and  dimension  as  shown  in  (2-5).  All  the  run-zero  m-tuplcs  are  the 
columns  of  the  code's  parity  check  matrix. 

n  —  2m  —  1 

k~tt  —  ra  (2-5) 


Thus,  there  is  one  Hamming  code  for  each  value  of  ra. 

The  (15,11)  single  error  correcting  Hamming  code  has  the  parity  check  matrix,  (2-6),  produced 
by  writing  as  columns  all  the  binary  4-tuples,  ordered  numerically. 


H- 


fOOOOOOOllllllll 

000111100001111 

011001100110011 

101010101010101 


(2-6) 
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To  obtain  the  generator  matrix  of  this  code,  the  columns  of  the  parity  check  matrix  ere  jtoideiud 

as  in  (2-3): 


HI  - 


loooooooinmi 

010001110001111 

00101C110110011 

Loooi  1101 1010101. 


The  generator  matrix,  then,  can  be  written  according  to  (2-4)  as: 

10000000000001 11 
0100000000001011 
0010000000001101 
0001000000001110 
0000100000010011 

G~  0000010000010101  (2-8) 

0000001000010110 
0000000100011001 
0000000010011010 
0000000001011100 
L00000000001 11111 

The  16th  column,  an  even  parity  check  on  the  enure  row,  has  been  added  for  double  error  detection. 

Any  error  correcting  block  code  of  length  n  can  be  shortened  to  length  (n  -  s)  by  setting  to  zero  s 
positions  in  the  information  vector.  If  the  first  s  positions  arc  those  set  to  zero,  then  all  codewords  will 
begin  with  s  zeros  which  need  not  be  transmitted.  This  results  in  a  code  of  length  (n  -  s)  and 
dimension  (k  -  s).  For  example,  to  shorten  the  (16,11)  code  of  (2-8),  we  can  set  the  first  four 
information  positions  to  zero,  resulting  in  the  (12,7)  shortened  Hamming  code  of  (2-9). 


100000010011 

010000010101 

001000010110 

000100011001 

000010011010 

000001011100 

Looooooiiiiu. 


(2-9) 


D.  PERFORMANCE  OF  (12,7)  SHOR1ENED  HAMMING  CODE  ON  BSCXp) 

It  is  useful  to  demonstrate  the  performance  of  this  code  at  this  point;  the  results  will  be  needed 
later,  as  well.  For  this  note,  the  binary  symmetric  channel  will  be  assumed.  Later  analyses  will  deal 
with  errors  which  occur  in  bursts. 

According  to  the  TACFIRE  da talink. protocol  [TACI^J,  the  occurrence  of  two  bit  errors  in  one 
character,  which  causes  an  error  detection  condition,  j  events  acknowledgement  of  receipt  of  the 
message;  therefore,  the  source  perceives  failure  of  message  receipt  and  retransmits  the  message.  The 
probability  of  such  an  event  is 


P*-1 


p3  (1-p)13**3 


(2-10) 


Table  2.1  shows  the  detected  character  error  probability  (and,  henre,  the  probability  of  a  non- 
oeknnwicdgcd  message)  as  a  function  of  the  channel  bit  error  probability  for  values  of  the  latter  from 
0.003  to  0.10. 


P _ Pc, 


0.10 

0.2301 

0.08 

0.1835 

0.07 

0.1565 

0.06 

0.1279 

0.0S 

0.09879 

0.04 

0.07206 

0.03 

0.04380 

0.02 

0.02157  * 

0.01 

5.968  x  lO*3 

0.008 

3.898  x  10*3 

0.006 

2.237  x  10~3 

0.005 

1.569  x  UT3 

0.004 

1.015  x  10“* 

0.003 

5.764  x  HT* 

Table  2.1.  Probability  of  detected  error  patterns  on  the  BSC 
These  data  are  plotted  in  Figure  3.1. 


ITT.  ERASURE  CHANNELS  AND  REED-SCLOMON  CODES 


A.  INTRODUCTION 

An  iteration  of  encoding  and  decoding  can  be  added  to  the  scheme  so  far  described.  Essentially, 
the  output  of  the  original  encoder  can  be  further  encoded  according  to  the  roles  for  another  suitably 
chosen  error  correcting  code.  At  the  channel  output,  the  original  code  can  first  be  decoded  as  before 
and  the  result  submitted  to  a  second  decoder  for  further  processing  [COOP78J.  It  is  useful  to  postulate 
a  different  kind  of  channel  when  introducing  the  additional  coding. 

B.  THE  ERASURE  CHANNEL 

In  a  received  message,  a  character  position  where  a  detectable  but  unconectable  error  pattern  has 
occurred  in  the  channel  can  be  considered  as  an  erasure,  Le.,  a  location  where  die  decoder  knows  that 
an  error  pattern  has  occurred  which  it  cannot  correct  Linear  block  codes  can  handle  ensures  more 
handily  than  they  can  handle  errors  whose  positions  axe  unknown  Bar  example,  a  code  with  minimum 
distance  dean  correct  (fill  in)  e  erasures  in  a  received  word  where 


d  >e  +  l 

(3-la) 

d>2t+l 

(3-lb) 

where  t  —  the  number  of  errors  correctable  by  the  same  code. 

We  now  take  a  modified  viewpoint  and  consider  a  noisy  channel  transmitting  characters  (binary 
m -tuples)  rather  than  individual  bits.  [GORE73J.  Characters  either  are  received  correctly  from  this 
channel  or  they  are  erased.  The  probability  of  a  character  erasure  is  the  probability  of  any  double  weight 
error  pattern.  Bor  the  (12,7)  shortened  Hamming  code  discussed  in  Section  II,  this  is  given  by  (2-10). 
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S:>  ire  channel  tnier  consideration  accepts  cl.niacicn.  represented  by  binary  m-tuplcs  aril  prc.ents 
to  the  destination  i  era.  c ter  erasure  with  probability  given  by  (2-10).  In  what  follows,  an  error 
detccung  and  corrccu-^  scheme  to  make  this  channel  quite  reliable  isdc'cribcd. 

C.  EXTENSION  FIELDS  AND  THE  BINARY  SYMMETRIC  CHANNEL 

If  binary  symbols  arc  manipulated  m  at  a  time,  modern  algebra  permits  all  the  ordinary  arithmetic 
operations  (addition,  multiplication,  inverses,  and  identities)  customarily  performed  upon  real  numbers 
[BERL78].  We  say  that  such  a  set  of  elements  and  operations  is  a  Galois  field  of  2m  elements,  GF(2m). 
(When  m  “  1,  we  have  the  familiar  binary  field)  In  GK2™  ),  we  can  cons,  met  linear  block  codes  as 
we  did  in  the  finite  cc'c:  to  every  block  of  k  information  symbols  from  GF(2m  )  append  (n  -  k)  parity 
cheek  symbols  as  linear  combinations  of  the  information.  Note  that  this  arithmetic  is  performed  in 
GF(2ra  ). 

An  interesting  class  of  codes  for  these  fields  is  that  of  the  Reed-Solomon  (RS)  codes,  which  have 
the  largest  minimum  distance  possible  for  a  given  length  and  dimension  (n,k)  (BERL68J.  A  RS  code  is 
a  linear  block  code  with  symbols  from  GF(2m),  length  n  —  2ra  —  1,  and  minimum  distance 
d  —  n  —  k  + 1  [REED60].  For  a  suitable  choice  of  m,  we  will  select  a  RS  code  for  use  on  the  erasure 
channel  previously  described 

D.  PERFORMANCE  OF  THE  COMBINED  CODING  SCHEME 

Before  actually  computing  the  performance  of  this  coding  scheme,  its  mappings  will  be  carefully 
presented  A  specific  example  will  be  considered 

The  character  set  is  a  49-element  subset  of  full  7-bit  ASCII.  Each  character  is  formed  according 
to  the  standard  7-bit  patterns.  However,  since  49  <  64,  only  6  bits  are  actually  needed  in  order  to  have 
a  unique  binary  pattern  for  each  symbol  to  be  transraitted-a  fact  which  we  shall  now  use. 

As  7-bit  ASG7  characters  are  produced,  they  arc  encoded  by  the  (12,7)  Hamming  encoder  as  at 
present.  Simultaneously,  however,  these  7  bit  characters  are  mapped  into  6-bit  patterns.  When  48  of 
these  have  been  bulfared,  15  parity  checks  will  be  computed  on  them  according  to  the  generator  matrix 
of  a  (63,48)  Reed-Solomon  code  over  GF(26  ).  These  parity  check  symbols  are,  of  course,  binary  6- 
tuples.  Tnese  6-tup'.es  will  be  mapped  back  into  binary  7-tuples  according  to  the  inverse  of  the  7  to  6 
mapping.  (A  rule  or  a  table  can  be  used,,  so  long  as  the  transformation  is  reversible.)  These  15  7-bit 
characters  arc  now  encoded  by  the  (12,7)  Hamming  encoder  and  are  concatenated  with  the  48  message 
characters  previously  encoded 


Decoding  is  accomplished  in  two  stages.  Received  information  is  processed  first  by  the  rtryortrr 
for  the  (12,7)  Hamming  code.  Any  double  weight  error  pattern  will  force  this  to  output  an 

"erasure"  condition  Cor  reedy  decoded  characters  are  presented  as  binary  7-mples.  These  are 
converted  to  6-niples  using  the  same  map  as  at  the  encoder  and  are  presented  to  the  for  the 

(63,48)  RScode. 

Since  eharencr  erasures  are  easily  sensed  by  the  RS  decoder,  it  should  ran  try  to  decode  when 
more  than  15  erasures  have  occurred.  In  the  context  of  this  note,  the  received  message  should  r»t  be 
acknowledged.  Tn:  probability  that  a  received  message  is  not  acknowledged  is,  therefore, 

(6j3)  Pee  (l-Pce>°“J  (3-2) 

>-16 

where  P^.  is  the  probability  of  a  character  erasure  at  the  output  of  the  Hamming  decoder.  Values  of  P* 
for  the  BSC  are  obtained  from  Table  2.1.  and  the  final  results  are- shown  in  Table  3.1  and.  plotted  in 
Figure  3.1. 


82 


I 


K 


c 


fv 


.  .'v  i.w.*  A.ii.J*_». ic-a. ^.vucr.jLw.^3 *  jn^koi'rv «i.«  —  v  4* ^ ^  - «r  _«r^  •  _  — • 


p 

P*. 

0.10 

O.IiOl 

0.372 

o.os 

0.1835 

0.103 

0.07 

0.1565 

0.0307 

0.06 

0.1279 

4.94  x  1CT3 

0.05 

0.09879 

3.21  x  It r4' 

0.04 

0.07206 

5.25  x  10-6 

0.03 

0.04380 

9.36  x  10-9 

0.02 

0.02157 

3.09  x  10"13 

0.01 

5.968  x  10-3 

7.30  x  10"22 

0.008 

3.968  x  10"3 

8.72  x  1CT25 

0.006 

2.237  x  i0~3 

1.30  x  10-2* 

0.005 

1.569  x  10’3 

4.61  x  10"31 

0.004 

1.01.5  x  10~3 

4.45  x  HT* 

0.003 

5.764  x  1£T* 

5.29  x  10"3* 

Table  3-1  Probability  of  decoding  failure  for  RS  codes. 


IV.  CONCLUSIONS 


A.  DISCUSSION  OF  RESULTS 

Improvement  of  the  message  rejection  rate  by  an  order  of  magnitude  has  been  demonstrated. 
The  original  coding  technique  employed  only  a  shortened  Hamming  code;  when  used  on  channels  with 
coherent  f  equency  shift  keying  (F5K)  (a  common  method  of  impressing  digital  infoimation  onto  FM 
radio  signals)  with  values  of  signal  to  noise  ratio  of  approximately  4  to  8  dB,  it  produced  message 
rejection  rates  ranging  from  6  x  10-4  to  0.1.  With  the  concatenation  of  RS  codes;  these  rates  dropped 
to  a  range  of  5  x  10-3*  to  4  x  10*4 . 

The  penalty  to  be  paid  for  this  improvement  is  twofold.  First,,  messages,  have  been  lengthened 
from  48  to  63  characters,  an  increase  of  31%  with  no  corresponding  increase  in  the  amount  of 
information  transmitted.  Second,  an  additional  stage  of  encoding  and,  more  significantly,  of  decoding 
must  be  added.  While  many  efficient  decoding  algorithms  for  RS  codes  are  known,  e.g.  [RLAH79],  the 
evaluation  of  the  added  complexity  must  be  the  subject  of  another  report. 


B.  FURTHER  WORK 

Undetectable  error  patterns  are  more  insidious  than  those  considered  in  this  note.  For  example, 
38  error  patterns  of  weight  four  are  codewords.  Since  the  sura  of  two  codewords  is  a  codeword,  the 
received  vector  will  be  one  also.  In  such  cases,  no  error  condition  can  be  detected.  This  behavior  will 
be  examined  in  a  forthcoming  report 

In  addition  to  determining  the  impact  of  adding  RS  decoders  to  existing  message  processing 
systems,  other  factors  must  be  studied  before  this  effort  is  complete: 

Ibr  this  analysis,  the  binary  symmetric  channel  with  values  of  p  from  0.003  to  0.1  was  used.  As 
asserted,  the  RSG  is  a  valid  model  for  certain  quiet  channels  which  are  limited  by  the  noise  generated  in 
the  rf  amplifier  of  the  receiver.  Conditions  under  which  such  a  model  is  valid  must  be  determined. 
Further,  more  realistic  noise  and  interference  models  (e.g.  noise  bursts)  must  be  considered,  and 
coding  techniques  such  as  the  one  studied  here  must  be  evaluated  against  them. 
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Another  consideration  is  that  channel  models  such  as  ihe  BSC  assume  that  the  world  is  discrete 
v,ticn,  in  feet,  it  is  not.  Such  channel  models  arc  realized  in  practice  by  examining  the  recciwd 
wa\cforrD  (signal  +  reuse)  and  making  a  statistical  decision  as  to  whether  a  binary  0  or  a  1  was 
received.  In  the  process,  information  about  the  reliability  of  that  decision  is  discarded  To  use  that 
information  in  order  to  improve  character  and  message  reception  reliability,  "soft  decision*  detection 
and  decoding  techniques  IFARR79J  arc.  under  investigatioa  Significant  improvements  in  message 
communication  have  been  claimed  for  these  methods,  and  they  should  be  investigated. 

Finally,  the  Hamming  ard  RS  codes  were  chosen  tor  this  pan  of  the  investigation  because  of  their 
popularity  among  coding  theorists  and  communication  system  designers.  The  technique  studied  above 
is  related  to  "concatenated  codc^  [FORN66]  which  can  be  constructed  from  a  variety  of  sets  of 
constituent  codes  ICOOP78].  Research  is  needed  to  select,  for  this  application,  codes  which  are 
optimum  in  terms  of  performance  and  decoding  complexity. 
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ABSTRACT 

The  objective  of  this  paper  is  to  illustrate  that  symbol  manipulation  systems  can 
readily  handle  many  of  the  typical  symbolic  calculations  arising  in  the  formulation  of 
problems  in  kinematics  and  mechanical  systems. 

The  paper  consists  of  two  parts.  First,  we  discuss  the  use  of  MACSYMA  in  con¬ 
nection  with  the  algebraic  manipulations  involved  in  transferring  a  body  from  one  po¬ 
sition  to  another  in  space,  with  particular  reference  to  Rodrigues  and  Euler  parameters 
and  successive  rotations,  and  an  example  involving  quaternions.  Second,  we  indicate 
how  MACSYMA  can  be  used  to  set  up  dyramical  equations  for  the  Stanford  manipu¬ 
lator  arm,  and  a  spacecraft  problem. 

INTRODUCTION 

Kinematics  is  a  basic  tool  for  the  analysis  of  mechanisms  and  mechanical  systems. 
Until  recently,  the  most  common  approach  has  been  to  use  vectors  and  Euler  angles. 
More  recently,  other  approaches  have  been  gaining  in  popularity  because  of  comput¬ 
ers.  We  illustrate  by  several  examples  that  these  approaches  are  particularly  amenable 
to  symbolic  manipulation.  The  immediate  objective  is  limited,  namely  to  indicate  that 
several  methods  of  representing  rotations  including  Rodrigues  and  Euler  parameters, 
and  quaternions  can  be  handled  by  MACSYMA  by  a  unified  approach  that  would 
seem  to  have  some  elements  of  novelty.  But  also  it  should  be  clear  that  our  examples 
suggest  a  different  approach  to  dynamical  problems  such  as  those  considered  by 
Branets  and  Shmyglevskiy  [3]  using  quaternions  and  Dimentberg  [4]  using  the  screw 
calculus.  The  nearest  connected  account  of  the  type  of  approach  we  have  in  mind  is 
the  mss.  [12]  by  Nikravesh  et  al.,  but  a  systematic  use  of  computer  symbolic  manipu¬ 
lation  would  certainly  affect  the  detailed  treatment.  This  is  the  first  part  of  the  paper. 

It  is  clear  that  the  complexity  of  mechanical  systems  is  increasing  to  the  point 
where  symbol  manipulation  must  play  an  important  part  in  their  formulation  and  solu¬ 
tion.  We  illustrate  by  two  dynamical  examples,  one  involving  a  robot  arm,  the  other  a 
spacecraft  problem.  The  main  reason  for  choosing  these  particular  examples  is  th««t 

This  paper  will  also  be  published  in  the  Proceedings  i.f  the  Third  MACSYMA  Users’  Conference 

1984,  supported  by  DOO. 
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the  equation!)  have  been  formulated  and  published  in  quite  a  detailed  form  already. 
By  comparing  our  treatment  with  those  already  published,  the  reader  will  be  able  to 
make  a  judgment  for  himself  concerning  the  usefulness  of  MACSYM  A,  and  also  how 
thinking  in  terms  of  symbol  manipulation  does  change  one's  approach  to  the  formula¬ 
tion  of  the  equations.  This  is  the  second  part  of  the  paper. 

We  give  the  MACSYMA  programs  in  detail  in  Appendices  for  two  reasons.  Ex¬ 
perienced  MAC>  MA  users  may  be  able  to  suggest  improvements.  Readers  familiar 
with  oth-.t  symbol  manipulation  systems  may  care  to  v.’rite  programs  for  the  same 
problems,  and  compare  their  programs  with  those  in  the  Appendices.  For  the  benefit 
of  new  and  old  users  we  encourage  authors  to  publish  their  programs  in  detail,  as 
done  here. 

KINEMATICS  EXAMPLES 

*.  The  Representation  of  Rotation  by  Orthogonal  Matrices 

We  renvnei  the  reader  of  some  standard  results.  W'e  work  in  terms  of  matrices 
.>  is  can  be  cc  .  .  *rted  into  vector  interpretations  as  appropriate)  using  upper  case  for 
matrices. 


A  rota,  ji-  of  a  body  with  a  fixed  point  by  an  angle  tf>  around  an  axis  defined  by 
the  unit  column  matrix  n  =  [n j,n2.n3]T  transfers  a  point  r ««  [x,y,z]T  into  a  point  r'  * 
[x',y',z']T  by  (cf.  Bottema  and  Roth  111.  p.  59)  r'  —  Ar  where  (see  Figure  1) 

A  =  [cos^I  +  (1  -  cos#)nnT  +  sin^N]  ,  (1) 


N  = 


0  -n3  n2 
n3  0  — ni 


— n2  nj  0 


(2) 


(Note  that  Nr  corresponds  to  the  vector  n  x  r,  and  I  is  the  identity  matrix). 


Figure  I.  Rotation  of  s  body  with  a  fixed  point. 


The  matrix  A  is  orthogonal.  We  discuss  three  different  ways  of  proving  this  using 
MACSYMA: 

a)  The  simplest  and  most  direct  way  is  to  express  (1)  in  component  form  and 
simply  check  by  brute  force  that  ATA  *  I,  using  TELLS1MP  to  impose  side- 
conditions. 
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b)  Alternatively  we  could  use  MACSYMA  interactively  as  follows.  It  is  easily 
checked  that 

nTn  =  1,  Nt=-N,  nTN  =  0,  Nn=0,  N2=  nnT-I  (3) 

We  use  MACSYMA  to  form  ATA  ,  which  will  give  nine  terms  involving  nTN, 
nnTnnT,  N2  etc.,  and  we  use  SUBST  to  simplify  and  finally  derive  ATA  =  I. 

c)  We  can  use  TELLSIMP  to  build  the  rules  (3)  into  MACSYMA.  Then  a 
MACSYMA  program  can  be  written  to  produce  the  result  I  for  ATA. 

Method  a)  is  clearly  simplest.  Method  c)  is  surprisingly  tricky  in  MACSYMA  be¬ 
cause  in  addition  to  (3)  we  have  to  distinguish  between  scalars  and  matrices,  and  set 
proper  switches.  For  verifying  that  AAT  =  i,  the  simplest  method  is  to  use  a)  not  c), 
but  for  more  complicated  prc.  blems,  method  a)  soon  produces  algebraic  expressions  of 
horrendous  complexity.  As  problem  size  increases,  method  c)  will  become  preferable. 
In  this  paper,  we  have  used  the  component  form  but  further  developments  may  re¬ 
quire  the  more  abstract  approach. 

2.  Rodrigues  Parameters 

We  introduce  these  by  stating  the  result  that  any  3x3  orthogonal  matrix  A  can  be 
expressed  in  the  following  product  form  by  the  Cayley-Klein  decomposi  ion  which 
says  that  there  exists  a  skew-symmetric  3x3  matrix  B  such  that  (cf.  Bottema  and 
Roth  [11,  p.  10): 

A  =*  (I  —  B)~l  (I  +  B)  (4) 

This  tells  us  immediately  that  B  =  (A  -  I)(A  +  I)-1.  MACSYMA  gives  us  directly 
(Appendix  I): 

bj  —  nj  tan  i  =  1,2,3  (5) 

The  bj  0  =  1 ,2,3)  are  the  Rodrigues  parameters. 

We  first  express  A  in  terms  of  the  Rodrigues  parameters.  We  find  (Appendix  II) 
cf.  Bottema  and  Roth  [1]  p.  148: 

1  +  bj2  —  b2  --  b3  2(bjb2  b3)  2(bjb3  +  b2) 

2(b2b1  +  b3)  1  -  b?+  b22-  b32  2(b2b3-b,)  (6) 

2 (b3b j  -  b2)  2(b3b2  +  b2)  1  -  b?  -  b22  +  b32 

i 

where  A  =  1  +  b22  +  b2  +  b32.  Using  the  notation  A  =  [a^,  it  is  clear  from  this  result 
that: 

bj  —  (a32  —  a23)/d 

b2  =  (ai3  —  a3i)/d  (7) 

b3  =  (a2j  —  aJ2)/d 

with  d  =*  1  +  a  |  j  +  a22  +  a33.  Having  established  the  necessary  background,  we 


derive  typical  basic  results  by  means  of  MACSYMA.  The  reader  should  compare  our 
derivation  with  those  of,  for  example,  Bottema  and  Roth  til,  Gibbs  [5],  and 
Dimentberg  [4]. 

Consider  the  result  of  first  rotating  a  body  round  an  axis  n  by  angle  $,  then 
around  a  second  axis  n'  by  an  angle  4>'.  Euler's  theorem  tells  us  that  the  result  is 
equivalent  to  a  rotation  by  s6me  angle  <f>"  round  some  axis  u".  In  matrices,  if  the  ma¬ 
trices  corresponding  to  these  three  rotations  are  A,  A',  A"  and  we  start  with  a  point  r, 
this  is  first  transformed  into  r'  =  Ar,  and  then  r'  is  transformed  into  r"  =  AT'.  We 
also  have  r"  -  A"r  so  that 

A"  -  A'A 

The  Rodrigues  parameters  corresponding  to  n",  <f>"  are  given  by  (7)  where  a^  are  the 
elements  of  A".  But  these  are  given  in  terms  of  the  first  two  rotations  by  the  the 
corresponding  elemnts  of  A'A  These  matrix  relations  are  carried  out  by  MACSYMA 
in  Appendix  III,  giving  the  result: 

b"  =  b±bizJW  (8) 

1  -  bTb' 

where  B  is  related  to  b  as  N  was  to  n  in  (2). 

Note  that  this  is  a  straightforward  derivation  that  would  be  laborious  to  carry  out 
by  hand,  a: :  compared  with  derivations  carried  out  in  the  literature  designed  for  the 
ease  of  hand  computation. 

3.  Euler  Parameters 

Instead  of  using  Rodrigues  parameter  bj,  it  is  often  convenient  to  use  Euler  param¬ 
eters  Cj  related  to  bj  by  (Bottema  and  Roth  [i]  p.  150) 

bi “  c/co  >  co  +  ci  +  c2  +  c3  «  1  (9) 

The  relation  (5)  then  gives 


Co  +  Cj2 -  c2  -  c2 
2(CqC3  +  C2Cj) 
2(-CqC2  +  C3Cj) 


2(-CqC3  +  C,C2)  2(CqC2  +  CjC3) 

Zq  -  c2  +  c2  -  C3  2(-C0Ci  +  c2c3) 
2(CoC!  +  c3c2)  c02  -  Cj2  -  c22  +  c32 


Although  it  would  seem  that  the  Euler  parameters  are  straightforward  homogene¬ 
ous  forms  of  the  Rodrigues  parameters,  it  turns  out  that  some  relations  are  expressed 
much  more  simply  in  terms  of  the  Euler  parameters. 

One  example  is  the  Euler  parameter  analog  of  (8)  for  two  successive  rotations.  To 
derive  this,  substitute  b  =  c/c0  ,  b'  =  c'/c0  in  (8)  which  gives: 

«,) 

CtC3  -  C*C 

When  this  is  written  out  in  detail  we  find  that  by  introducing 
c0"  -  c0'c0  -  c,'Cj  -  c2'c2  --  c3'c3  (12) 


pan 


l:i 


c0  *  C0C0-  C,Cj  -  C2C2-  C3C3 

Cl"  —  C,'c0  +  Cq'Ci  -  c3'c2  +  c2'c3  (12) 

C2"  =  C2'c0  +  C3'c,  +  C0'c2  -  C!'c3 

C3"  “  C3'Co  "  c2'ci  +  c1/c2  +  c0'c3 
equation  (11)  can  be  written' in  the  simple  form 

b"-c7c0"  (13) 

In  Appendix  IV  we  ch  ^  by  MACSYMA  that  if  Cq  +  cTc  =  1,  (cq')2  +  (c')V  =  1, 
then  (co")2  +  (c")Tc"  =  which  is  a  well-known  result  due  to  Euler.  This  result  and 
(12)  mean  that  c0",  Cj",  c2",  c3"  are  the  Euler  parameters  corresponding  to  the  total 
rotation. 

In  the  literature,  the  result  (12)  is  often  derived  via  quaternions  (e.g.  Bottema  and 
Roth  [1],  p.  518,520).  It  is  of  some  interest  to  express  this  approach  in  the  present 
context  of  Euler  parameters  and  matrices  which  can  be  done  without  mentioning 
quaternions  explicitly.  Introduce  y  and  T  defined  as  follows: 


co  ”ci  — c2 
C1  c0  “c3 
C2  c3  CO 
C3  “C2  C, 


CO  “Cl 
C,  C0 


If  y',r  are  the  corresponding  matrices  with  c'  in  place  of  c,  and  similarly  for  y",r", 
we  define  the  product  y'y  by  (compare  the  remark  following  (2)): 

y"  -  y'y  -  f  'y  (14a) 

which  says  exactly  the  same  as  (12).  We  first  note  that  if  we  define  y_l  — 
[c0, — c j , — c2, — c3] T  then  yy-1  *  y-,y  -  [1,0,0,0]T.  It  can  be  verified  (e.g.  by  the 
MACSYMA  program  in  Appendix  V)  that  introducing  p—  fro,rj,r2,r3]T,  r  — 
Iri,r2,r3]T  and  p',r'  correspondingly,  then  if  we  form  ypy-1,  and  denote  the  result  by 
p\  then 


Where  A  is  precisely  the  matrix  that  appeared  in  (10),  i.e.,  ypy-1  represents  a  ro¬ 
tation.  This  is  our  version  of  the  standard  quaternion  theorem  on  rotation,  derived  of 
course  from  a  completely  different  point  of  view  (cf.  Brand  [2J,  p.  417).  A  second  ro¬ 
tation  would  give  p"—  y'p'(y')-1,  and  combining  the  rotations  leads  to  p"  — 
y'yp  (y')“1y“1,  i.e.,  if  y"  represents  the  combined  rotation  then  y"~  y'y,  which  is 
identical  with  (12). 

Still  another  way  of  obtaining  (12)  is  suggested  by  the  discussion  of  Cayley-Klein 
parameters  in  Bottema  and  Roth  ({1]  p.  529),  namely  that  a  result  corresponding  to 
equation  (9.8)  in  that  reference  should  hold  for  Euler  parameters.  We  introduce  the 
notations: 

V  -  c0I  +  S 
V_,»  CqI  —  S 


-  ‘Air.  l'J\  TEttCX  L.\  S3CC 


AT  JlV-i 


•’  -•KJU  SU.VQ  ■lAi  .*T  J* ruwv KT^.  V 


0  -Ci  — c2  — c3 

y  z  0  — x 

cj  0  -c3  c2 

z  — y  x  0 

c2  c2  0  -ci 

,  q  - 

0  x  y  z 

C3  -C2  c,  0 

-x  0  z  -y 

Y  7  0  -X 

Z  -Y  X  0 
0  X  Y  Z 
-X  0  Z  -Y 


The  MACSYMA  program  in  Appendix  VI  does  the  following.  We  form  VqV-1  and 
equate  this  to  Q.  This  gives  16  equations.  However,  it  is  easily  checked  by  MACSY¬ 
MA  that,  in  fact,  there  are  only  three  independent  relations  involving  x,y,z  and  X,Y,Z 
which  can  be  written  in  the  form 

Aq  =  Q 

where  A  is  exactly  the  A  given  in  (10).  The  implication  of  this,  in  connection  with 
repeated  rotations,  is  that  if  q  corresponds  to  r  and  Q  to  r'  defined  in  the  second  para¬ 
graph  of  Section  1  and.  the  corresponding  A  is  denoted  by  V,  then 

VrV1  -  r' 


'  and  the  rotation  from  the  initial  po- 
Eliminating  r'  we  have  V"r(V")~  — 

(Mb) 


Similarly,  the  second  rotation  gives  VYY'”1  = 
sition  to  the  final  position  gives  V"rV"_1  =  r" 

V'VrV'V'-1  so  that  finally 

V"  =  V'V 

and  this  is  precisely  equation  (12)  cf.  (14a). 

4.  An  Example  Involving  Dual  Quaternions 

The  discussion  in  the  last  two  sections  was  concerned  with  the  rotation  of  a  body 
with  a  fixed  point  and  involved  only  three  independent  parameters.  The  general  mo¬ 
tion  of  a  body  involves  displacement,  as  well  as  rotation,  and  requires  six  independent 
parameters.  Rather  than  extending  the  methods  of  the  last  two  sections,  we  illustrate 
how  MACSYMA  deals  with  a  rather  different  approach  to  kinematics,  namely  via 
quaternions,  by  considering  a  calculation  in  a  classic  paper  by  Yang  and  Freudenslein 
(114],  1964)  dealing  with  a  spatial  four-bar  mechanism. 

In  Figure  2,  MA  and  NB  are  two  nonparallel  and  nonintersecting  lines.  MN  is  the 
common  perpendicular.  Let  a,b  denote  unU  vectors  in  the  direction  of  MA,  NB 
respectively,  and  let  ra,rb  denote  the  vectors  OM,  ON.  We  introduce  the  quaternions 

a-a  +  €(raxa)  ,  b*b  +  €<rbxb) 

where  €  is  a  symbol  with  the  property  that  c2  =  0.  Note  that  this  implies,  for  exam¬ 
ple,  that  if  9  —  9  +  es  then 
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sin0  —  sin0  +  cscos0  ,  cos0  —  cos0  —  essin0  (15) 

As  discussed  by  Yang  et  al.  [14],  the  relative  sftift  between  a  and  b  can  be  expressed 
as 

b-Qa  ,  a-bQ 

where  Q  is  a  duai  quaternion  (see  [141,  (22,  23)).  Successive  application  of  formulae 
of  this  type  gives  rise  to  a  loop  ciosure  equation  for  the  mechanism  of  the  form: 

A(0j)sin04  +  B(0j)cos04  -  C(0j)  (16) 

where 


A  A 

A(0-,)  =  sinai2sina34sin0i 

B(0j)  =  -sina34  (sina41cosa|2  +  cosa4isina12cos0i) 

C(0i)  =  cosa23  —  cosa34  (cosa4i  cosa-12  —  sina4isinai2cos0i) 


Here 


«12“  a12  +  eat2  ,  0i-0l+€Sn 

®23  33  a23  4*  €823  ,  02  *  02  +  €S2 
«34  "  «34  +  €a34  »  ^3  “  03  +  eS3 

A  A 

a41  *  “41  +  «a41  1  04  **  04  +  €S4 

It  is  then  dear  that  (15)  can  be  reduced  to  the  form 
P  +  eQ  =  R  +  eS 

where  P,  Q,  R,  and  S  are  independent  of  c.  It  is  required  to  find  the  explicit  form  of 
P,  Q,  R,  and  S.  To  calculate  this  by  hand  is  extremely  laborious,  but  straightforward 
in  MACSYMA.  The  program  is  given  in  Appendix  VII. 


M 
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Figure  2.  Relative  position  of  two  line  vectors. 


TWO  EXAMPLES  IN  DYNAMICS 


5.  Equations  of  Motion  for  the  Stanford  Manipulator  Arm 

There  are  a  number  of  ways  to  set  up  dynamical  equations  for  robot  manipulator 
arms  (see  Paul  1 13.1) .  Kane-Levinson  19]  have  given  an  example  of  setting  up  dynam¬ 
ical  equations  for  the  Stanford  manipulator.  Our  objective  is  to  reproduce  these  equa¬ 
tions  from  an  algorithmic  point  of  view,  without  having  to  do  by  hand  the  kind  of  ex¬ 
tensive  manipulation  given  in  that  paper.  The  method  can  help  us  to  set  up  similar 
sets  of  equations  for  any  manipulator  automatically,  thereby  reducing  the  labor.  We 
also  show  that  MACSYMA  can  simplify  the  Kane-Levinson  end-result,  reducing  the 
numbers  of  arithmetic  operations  required  to  obtain  numerical  results. 


We  consider  the  Stanford  manipulator  arm  (Paul  [13]),  a  six-element,  six-Jegree- 
of-freedom  manipulator.  A  schematic  representation  of  this  arm  is  given  in  Figure  3, 
from  Kane-Levinson  19],  where  more  details  can  be  found.  The  six  bodies  are  desig¬ 
nated  A,  ...,  F.  Body  A  can  be  rotated  about  a  vertical  axis  fixed  in  space.  A  supports 
B  which  can  be  rotated  about  a  horizontal  axis  fixed  relative  to  A.  The  figure  should 
now  be  self-explanatory,  the  joint  connecting  B  and  C  being  translational,  and  the 
remaining  joints  rotational. 


qt, ...,  q6  are  generalized  coordinates  characterizing  the  instantaneous  configuration 
of  the  arms,  the  first  five  being  rotational  and  q6  translational.  For  the  plane 
configuration  of  the  arms  as  drawn  in  Figure  3,  it  is  assumed  that  qb  •  •  •  q5  are 
zero. 

We  choose  coordinate  axes  as  follows.  ni?  n2,  n3  are  unit  vectors  fixed  in  space  as 
indicated  in  Figure  3,  n1?  n2  lying  in  the  plane  of  the  paper,  aj,  a2,  a3  are  unit  vectors 
fixed  in  the  arm  A  which  coincide  with  n5,  n2,  n3  when  the  arm  is  in  the  configuration 
of  Figure  3.  Similarly,  bj,  b3  are  unit  vectors  attached  to  the  arm  B  and  similarly 
for  C,  D,  E,  and  F. 

We  give  a  mathematical  description  of  an  algorithm  for  setting  up  the  dynamical 
equations.  This  is  essentially  the  algorithm  described  by  Kane-Levinson  [9],  but  orga¬ 
nized  in  a  somewhat  different  way  in  order  to  facilitate  implementation  on  MACSY¬ 
MA.  The  stages  and  details  of  the  MACSYMA  program  which  are  in  Appendix  VIII, 
parallel  the  mathematical  description  that  follows: 

Stage  1 :  Set  up  angular  velocities: 

Rotations  about  x,y,z  axes  can  be  described  by  orthogonal  matrices  of  simple  form 
as  discussed  in  detail  Ky  Paul  [13],  Chapter  1.  For  instance,  rotation  by  an  angle  0 
about  the  x-axis  involves  ([13],  p.  15) 

10  0 

Rotx  (0)  =  0  cos0  -sin0 
0  sin0  cos0 

Let  Rj,  ...  R5  denote  matrices  corresponding  to  rotations  0j,  ...,  05  about  axes, 
y>x,y,x,y  respectively  in  the  local  coordinates  fixed  relative  to  arms  A,  B,  D,  E,  k 
Let  qj,  ...,  q5  denote  angular  velocities  around  y,x,y,x,y  axes  respectively.  These  are 


vector  quantities  represented  by  matrices  that  we  denote  by  toj,  o)5.  For  instance, 
—  I0,q|,0]  etc.  Similarly,  fot  the  linear  velocity  q6. 


F* 
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Figure  3.  A  schematic  representation  of  Stanford  manipulator  arm. 

Next  introduce  cuA,  ...,  o>F,  the  angular  velocities  of  A,  ...,  F  in  our  Newtonian 
frame  of  reference,  but  with  components  expressed  in  the  local  coordinate  frame  of 
reference.  For  example: 

wD  -  [ui,u2,u3jl  means:  <uD  -  ujd,  +  ujdg  +  u3&  (17) 

■  / 

The  algorithm  for  computing  o>A, ...,  tcF  is  given  by: 

»  wjRj 


=  (a>A  +  ct)2)R2 


<l)^  —  ((i)C  +  6)3)  R3 
o»E  —  (a>D  +  6*4)114 
<yF  =  (a>E  +  0)5^5 

If  these  formulae  are  used  as  they  stand,  the  expression  for  wF  in  terms  of  cjj  will  be 
complicated.  The  complexity  can  be  reduced  using  a  method  due  to  Kane* 
Levinson  [91.  The  us  that  occur  in  (16)  can  be  expressed  in  terms  of  qj,  q2, 0(3  as  fol- 


u,  =  q!  sinq2  slnq3  +  q2  cosq3 

u2  =  q,  cosq2  +  q3 

u3  -  -qj  sinq2  cosq3  4-  q2  sinq! 

«i  -  qj  i  -  4,5,6 
Stage  2  Set  up  linear  velocities: 

In  stage  1,  the  angular  velocities  were  always  expressed  in  local  coordinates 
corresponding  to  the  arm  being  considered.  This  is  not  necessarily  the  case  for  the 
way  in  which  Kane-Levinson  [9]  formulate  the  linear  velocities  (see  paragraph 
proceeding  (28)  in  th°-  paper).  Because  we  wish  our  results  to  be  comparable  to  those 
in  [9],  we  state  the  formulae  we  use,  which  will  lead  to  results  that  are  the  same  as 
those  in  equations  (28-43)  in  19].  (Note  that  the  stars  in  the  following  refer  to  the 
velocities  of  the  centers  of  mass  of  the  corresponding  arms.) 


vB*  «  wA  x  Rb 
vc*  »  ojb  x  RC  +  q6 
vD*  -  wB  x  Rd  +  q6 

The  expressions  for  vE\  vF*  correspond  to  those  in  equation  (40)  and  (42)  in  the 
K?ne-Levinson  paper  193.  The  exact  form  we  use  can  be  found  from  the  expressions 
for  VE  and  VF  in  the  MACSYMA  program  given,  in  Appendix  VIII. 

The  remaining  stages  are  relatively  straightforward. 

Stage  3:  Find  the  partial  angular  velocities. 

Stage  4:  Find  the  partial  linear  velocities. 

These  are  explained  in  the  Kane-Levinson  paper  [91  and  the  MACSYMA  imple¬ 
mentation  in  Appendix  VIII  is  self-explanatory. 

Stage  5:  Find  the  angular  accelerations. 

Stage  6:  Find  the  linear  accelerations. 


These  are  obtained  by  simple  differentiation  cf  the  corresponding  angular  and 
linear  velocities  as  given  in  the  MACSYMA  program  in  Appendix  VIII. 

Stage  f.  Define  moments  of  inertia. 

We  next  have  to  consider  forces. 

Stage  8:  define  torques. 

Stage  9.  Set  up  generalized  forces. 

Stage  lOr.  Set  up  active  forces. 

Stage  11:  Set  up  Kane’s  equations. 

These  steps  are  straightforward;  the  MACSYMA  program  is  given  in 
Appendix  VIII. 

Finally,  Figure  4  gives  a  comparison  of  some  numerical  results  obtained  from 
MACSYMA  and  Kane-Levinson  19]. 

It  is  of  some  interest  to  compare  the  mathematical  equations  in  the  Kane-Levinson 
paper  with  the  corresponding  MACSYMA  expressions.  For  example,  consider: 

Kane-Levinson  19]  MACSYMA 

(13)  WA:  EXPAND(Wl.Rl) 

uA  S  WA  ,  q,a2  =  W1.R1 

(8)  Here  o»A,  ql5  a2  are  vectors; 

WA,  W1.R1  are  matrices 

(15) 


(16)  WB:  EXPAND  ((WA+W2).R2) 


One  point  here  is  that  because  Kane-Levinson  [9]  are  carrying  out  the  algebra  by 
hand,  it  is  convenient  for  them  to  introduce  intermediate  symbols  Zj,  Z2  *  *  *  going 
up  to  Z]96,  and  similarly,  36  X’s  and  31  W’s.  MACSYMA  has  no  difficulty  in  gen¬ 
erating  the  end  result  in  explicit  form.  These  end  results  are  no  more  complex  than 

95 


(underlined  quantities  are  vectc 


wA=s  dl£2 


di 


But 

_  UlS3  -  U3C3 
s2 


Introduce 

Z4-7-,  Zs- 

s2  c2 

Then  (13)  becomes: 
wA  **  (Z4u  j  +  Z5u3)a2 
Similarly, 

o)B  =*  Z^j  +  Zjob2  +  Zjjb3 

where 

Z10  “  ZgUj  +  Z7U3,  Z„  ”  ZgUj  +  Z9I 


Figure  5.  Two  rigid  bodies  with  a  common  axis  of  rotation. 


The  equations  are  given  in  complete  detail  in  Ref.  [Ill,  and  translated  into 
MaCSYMA  in  Appendix  IX.  In  the  example  in  the  last  section,  we  wrote  the 
MACSYMA  program  in  terms  of  matrices.  In  Appendix  IX,  !he  present  example  is 
written  in  terms  of  vectors,  by  writing  BLOCK  functions  to  perform  the  dot  and  cross 
products.  To  illustrate  the  comparison  of  the  vector  equation  with  the  corresponding 
MACSYMA  expressions: 

Equations  from  Ref.  [11]  MACSYMA 


I?  - 

cosq  b2  +  sinq  b3 

(1) 

R  [2]  :COS  (Q)  *B  [2]  —  SIN  (Q)  *B  [3] ; 

ns 

(0  3=8 

Uibi  +  u2  b2  +  u3  03 

WB:U[l]*B[ll  +  U[2)*B[2]+U[3]*B[3l; 

fl4  - 

q 

U[4]:DIFF(Q,T); 

aR  = 

-j-  (a>R)  +  a>B  x  <oR 
dt 

(7) 

ALPR:DIFF(WR,T)  +  CROSS  (WB,WR); 

We  discuss  only  one  other  correspondence.  Equation  (27)  in  Ref.  [Ill  is 
Fr  =  •  (F)b  +  ~  ■  (T)„  <i—I,  •••,  7) 

OflT 

which  becomes  in  MACSYMA 

FlRl : = DOT  (DIFF  (VBS,U  [R] )  ,FB)  +  DOT(DIFF(WB,U[R]),TB); 

i 

The  complete  set  of  equations  given  in  Ref.  [11]  is  generated  by  Appendix  IX. 
The  reader  should  compare  the  corresponding  FORM  AC  program  given  in  Levin¬ 
son  [111. 

CONCLUDING  REMARKS 

It  should  be  clear  from  the  examples  given  that  symbolic  manipulation  by  comput¬ 
er  can  carry  out  many  of  the  laborious  and  routine  calculations  involved  in  the  analy¬ 
sis  of  mechanical  systems.  But,  potentially  even  more  important,  is  the  influence  that 


symbolic  manipulation  is  likely  to  have  on  the  methods  used  to  formulate  problems. 
The  reader  should  compare,  for  example,  the  algorithmic  approach  we  have  adopted 
to  the  Stanford  manipulator  arm  problem  with  the  approach  in  [9].  As  another  exam¬ 
ple,  if  symbolic  manipulation  methods  are  used,  this  will  influence  whether  we  formu¬ 
late  problems  in  terms  of  Euler  angles,  Euler  parameters,  Rodrigues  parameters,  or 
quaternions,  etc.  In  addition,  one  can  visualize  the  production  of  standard  MACSY- 
MA  software  to  produce  equations  corresponding  to  those  of  Kane-Levinson  [9]  for 
any  given  combination  of  rotating  and  sliding  joints. 
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APPENDIX  I 


APPENDIX  IV 


/•PROVE  THE  IDENTITY  OF  EQUATION  5  V 
/♦  TRKJNOMETRIC  SIMPLIFICATION  V 
MATCIIDIX'LAKttA.TRUF.). 

TELLSlMPISINCAIT.I-CDSCAI^l: 

/•  DEFINE  CROSS  PRODUCT  MATRIX  OR  ALTERNATING  TENSOR  V 

ALT(N).-MATRIX(IO.-NI3,II,N|2,1||.|N!I.I|.0,-NII,!||. 

i-NI2.ll.NII.ll.0l); 

N:M  A'f  RIXCtN  ll.lN2j.lN3)): 

NN.ALT(N): 

I.IDENTU); 

AACOS(ALPIIA)*I+(I-COS;aLPIIA))*(N  .  TRANSPOSECNH-INNTINCALPHA): 
AAP.I+AA: 

/*  WORK  WITH  HALF  ANGLES  */ 

ALPHA  BTA*2: 

EVIAA); 

TftICiSXPANDM); 

AA»S 

/*  ADD  IDENTITY  MATRIX  AND  INVERT  V 

AAP:AA+IS 

IAAP;AAP"I-IIJ 

/'SUBTRACT  IDENTITY  MATRIX  AND  FORM  MATRIX  PRODUCT  AS  ANSWER*/ 
AAMAAIS 

ANSWER;AAM  .  lAAPi 

/•USE  IDENTITY  THAT  Nrj+N2‘2+N?*2-I  •/ 

NN3:I-NI~2-N2~2; 

ANSWER. RATSUB$T(NN3,N3'2.  ANSWER): 

ANSWER  .RATSIMP(%): 

APPENDIX  II 


/*  CAYLEY'S  DECOMPOSITION  OF  ORTHOGONAL  MATRIX 
A  *  ll-B)‘-l  (I +BI.WHERE  RI.B2.1I3  ARE  RODRIGUES  PARAMETERS’/ 
/•  DEFINE  CROSS  PRODUCT  OR  ALTERNATING  TENSOR  MATRIX*/ 
ALT(N):«MATRIX<K).-ND.1].N(2.1|I,|N|3.I].3.-NII.1]|. 
J-NI2.ll.Nlt.ll.0l): 

BMATRIX(IBIUB2I.|B3|): 

BBALTIHI. 

I.IDENTU); 

INR«.U-W)"-I: 

A.INSB.It-f  BB>; 

AfcWER.RATSIMPI*); 

/•SOLVE  ABOVE  FOR  Rl  B2  B3.FOLLOWIN  IS  A  CROSS  CHECK  7 
DEL:RATSIMP(I  +  A|I,I1  + AI2.2I  +  AI3.3)); 
BBI:RATSIMP(I/DEL*(A|3.2|-A|2.3|)); 

BB2.RATSIMPI  l/DEI. *(Al  1,31- A|3.  I|)); 
BB3RATSIMr<l/DEL*(Al2.ll-A(l.2l)); 

APPENOIX  III 


/TWO  SUCCESIVE  ROTATIONS  IN  TERMS  OF  RODRIGUES  PARAMETER’/ 
ALTIN)  -MATRlX(IO.-Nl3.II.Nl2.lll.(Nl3.l].O..N(l.1ll,(-N12,ll.Nll.ll.C;); 
B.MATRIX(|BI|.|B2|.JB3J); 

BBALT(B): 

UDENTU): 

iNBB.CI-BB)"-); 

AJNBB.CI+BBU 

A:RATSIMP(W): 

BP:MATRIXttPPII,lBP2),!BP3l): 

Bi?:ALT(BP): 

INBBP.vt-BBP)"-l; 

AP.1NBBP.il  A  BB?)S 
AP.RATSIMP(W): 

APIEAP.A; 

/•SOLVE  ABOVE  FOR  BrTI  BPP2  BPP3  V 

DEL:RATSIMPn+APP|l,li+APP|2.2l+APP(3,3l); 

•PPI:RATSIMP(1/DEL*(APPl3.2l-APPi2.3))); 

BFP2  RATSIMPH/DFL’IAPPII, 3I-APP!3.I1)): 
BPP3:RATSIMP(l/DEL*(APPI2,l|-APPil,2l)); 

/•THE  ABOVE  RESULTS  ARE  SAME  AS  EQUATION  (I  I)  V 


/’  DERIVE  EULER'  IDENTITY  SEE  ALSO  BRAND  REF.  121  P.404V 
S:MATRIX( 

!o.  rci.  re?,  col. 

ICCI.O.-CC3.CC2I. 

ICC2.CC3.0..CCII. 

ICC3.-CC2.CCI.Ol): 

SP.M/.TRIX( 

l0.-CPI.-CP2.-CP3i. 

ICPI.0.-CP3.CP2I. 

ICP2.CP3.0.-CPI1. 

(CP3.-CP2.CP1.01): 

l:IDUNT(4): 

VCC0*l+S: 

VP.CP0*l+SP; 

MATLV.VP. 

/•  NOW  TAKE  THE  FIRST  COLUMN  OF  THE  ABOVE  MATRIX  AND  SQUARE  IT*/ 
MAT2SUBMATRIXIW.2J.4): 

ANSWER*.*; 

ANSWERlFACTOR(ANSWER): 

/•  NOTE  ABOVE  IS  A  COMPLETE  SQUAF.E  V 

APPENDIX  V 


/•QUATERNION  MULTIPLICATION  EXAMPLE  V 
/•  AN ALOG  OF  CAYLEY-KLE1N  RESULT  V 
EiDENTM); 

/•  NOW  WE  DEFINE  AN  OPERATION  SS  ON  A  COLUMN  MATIX  BASED  ON  ANALOG 
OF  CAYLEY  KI.EIN  DECOMPOSniON  V 
SSiCC):”MATRIXl(CCll,ll,-CC!2.ll.-CCl).ll.-CCM.IIl.  ' 
ICCi2.II.rCll.ll.-CCl«.II.CC|3.lll. 

1CCI3.I1.CCI4.I), CCH.li.-CCl2.lll. 
tCC(4.l|.-CC13.ll.CCl2.II.CC!l.li|): 

/•  DEFINE  AN  INVERSE  OPERATION  V 

INV(CC):*l/(CC.CC)*MA7RIX(ICC|!,ll|,(-CC|2,lll.|-CCIi.!!l.l-CC|4.ll|): 

/•  NOW  THE  BRANDS'S  THEOREM  ON  QUATERNION  FORMULATED  IN  MATRIX  FORM  ‘ 
RHO:MATRIX(lROI.IRlMR2l.tR3l): 

UAM:MATRIX(|Q0l.|oi:.|Q}|.|03!>: 

/•NOW  DEFINE  QUATERNION  PRODUCT  •/ 

APRCD(R,Q):“SS(fi).Q; 

A:MATRIX(lA01.|AI|.|A21,|A3|): 

RATSIMP(APROD(INV(A).Al), 

ANSWER:RATSlMP(APROO(GAM.AFROD(RHO,lNV(OAM>))): 

EQIUINSWERII.il: 

EQ2:ANSWER  12,11: 

EQ3.ANSWERD.ll; 

EQ4:ANSWER14.I|; 

/•  NOW  GENERATE  COEFFICIENT  MATRIX  FOR  RIIO  V 

COEFMATRIXClEQI.EQ2.FQ3.EQ4l.lRO.RI.R3.R3ll: 

r  THE  ABOVE  IS  Same  AS  EXTENDED  EULER  PARAMETER  MATIUX  V 

APPENDIX  VI 


/•  THE  BASIC  DECOMPOSITION  FOR  EULER  PARAMETER  V 

/TEST  OUT  (C0*l+S)X<CCri-S)  V 

ll:IDENT<4); 

SS:MATRIXItO.-CCI,-CC2,-CC3l. 

ICCI.0.-CC3.CC2), 

[CC2.CC3.0.-CCII, 

1CC3.XX2.CCI.01): 

Q.MATRIXHYAO.-Xl. 

|7,-V,X.0|. 

(0.X.Y.ZI. 

t-X.0i.-Yl); 

EQI:EXPAND((OC(rn+SS).Q.(CC0*n-SS)); 

TIEQII2.3I; 

T2EQIH.11; 

Tl:EQI|4.3l: 

ANSWER  COEFMATRIX(tTI.T2.T3I.IX.Yil); 

/•ABOVE  IS  SAME  A  S  EQUATION  1 1  V 


APPENDIX  VII 


/• . ALGKBAA  FOR  QUATERNIONS  FROM  YANG'S  PAPER....*/ 

NNPREDIN):-ISIN>«2): 

MATCHDF.CLARK<NN.NN?AED); 

Till  SIMPAItrRICrNN.O). 

/•ABOVE  WILL  ELIMINATE  cP*‘2  TERMS  */ 

AU2IIAUJ+tP*AIJ: 

AL23II  Al.23-1  EP*A23; 

AL34ll:ALJ4  +  ;r*A34; 

AL4IIIAL4I  +LI-A4I: 

TIIIII:TIII+EP*S1I: 

TII2II.T!I2+EP*S2: 

TII3ll:Til3+EP*S3: 

TH4II:TII4  +  EP*S4: 

SALI2II:  EXPANO(TAYLOR(SIN(ALi2ll!.EP.O.I)): 

SAL23II  fcXPAND(TAYLOR<SIN(AL23ll).EP,O.I)): 

SAL34II:  EXPAND  (TAYLOR  (S'1N(AL34II).£P,0.1)); 
SAL4III:EXPAND(TAYLOR(SIN(AL4III).EP.O.I)>: 

STII III-  EXPA NDITA YLOR (SiN (Til  I ID.EP.O.I »; 

ST  11211:  EXPAND(TAYLOR(SIN(TII2II).EP.O.O); 

STH3II:  EXPAND(TAYLCR(5IN(TU3H),EP,9,I»; 

ST11411:  EXPAND(TAYLOR(SIN(TII4MI.EP.0.l)); 

CALI2II  EXPAND(TAYLOR(COS(ALI2II),EP.O.IH; 

CAL23II  EXPAND(TAYLO*(COS(AL23ll),EP.9.l»; 
CAL34ll:EXPANDlTAYLOR(COS(AL34i;).EP,O.I»: 

CAL41H  EXPAND(TAYLOR(COS(AL41H).EP.O'.I)); 

CTIIIII:  EXPAN  DITAYLORICOSITII  III), EP.O.I)); 

CTII2II:  EXPAND(TAYLOR(COJ(TII2II).EP.O.I)); 

CTII3II:  EXPAND(TAYLOR(COS(TII3II),EP.O.II); 

CTII4II:  EXP  ANDITAYLOR  (COS  (TIMID, EP.O.I)); 
AAT1IIII5AH2H*SAL34H*STUIH; 

8BTIIIII:-SAL34H*(SAL41II*CALI2H+CAL4IH,SALI2H,CT1!IH); 

CCTIIill.CAL23HCAU4H,(CAL4ll(*CALI2!MAL4lirS/.LI2H*CnilH); 

EQI:AATIIIII*STII4!I4-MTHIHTTH4H-CCTHIH; 

PRIMARY:EV(EQI,E?-0>; 

DUAL:RATCOEFFlEQI.EP>; 

A:RATCOEFF(PRIMARYSIN(TI!4ll; 

B  RATCOEFFIPRIMARY.COS(TH4»: 

CEXPnND(PRIMARY-A*SIN(Ti;4)-rCOS(TM4)): 

DUALl:DUAL-S4*(A*COS(TH4).B*SINiTH4)); 

ao  ratcoeffiuuali.sinitim)): 

M  R  ATCOEFI(DUALI.COS(TII4)); 
CCO.EXPAND(OUALI-AO*SIN(TH4)-D(PCQ5(TH4)); 

CCO.RATSIMP(CCO); 

APPENDIX  VIII 


/•DYNAMICAL  EQUATIONS  FOR  STANFORD  MANIPULATOR*/ 
MATCIIOECLARE'A.TRUE); 

UEPEMi)S<lQI.Q2,Q3,Q4,Q5.Q6|,T); 

t)LPFNI)S(U.I); 

/•TRKiNOMETRIC  SIMPLIFICATIONS  7 
TELLSIMP(S(N(A)‘2.l-COS(A)‘2); 

SISIN(QI); 

CCICOSIQI); 

S25IN(Q?>: 

CC2COS(Q2); 

S3-5INIQ3): 

CC3COSI03I; 

/•  EXPRESS  LOCAL  ANCJULAR  VCLOCITiES  IN  TERMS  OF  OENERAUZED  ONES*/ 
QDl:l/S2,(U|ll*S3-UI31*Ca); 

<)D2.UI1I*CC3+U13I,SJ-. 

Q03.UI2I  +  (UI3|*CC3-UH|*S3)*CC2/S2; 

QD4  U|4|. 

QD3  L1|SI; 

QD6UI6I: 

GRADEF(QI.T.QDI); 

GRA0EF(02.T,QD2); 

CRAUEF(Q3.T.Q03); 

GRADEF<Q4.T,QD4); 

GRADEFIQJ.T.QDJ); 

GRADEFIQ4.T.QD6); 

/•DEFINE  ROTATIONS  7 

ROTX(QI:-MATRIX(IU>.OI.tO.COS(Q>.-SIN(Q)U03IN(Q).COS(Q)l); 

ROTY(QI:-MATR;XdCt3(Q).0„SIN(QI|.Kl.l,OUSIN(0>.O.COStQ!|); 

ROTZ(Q):«MATRIX(KTS((J).^IN(Q).OI,|SIN(Q).COS(Q).OUO.O.II); 

WI:MATRIX(W,QOI,(i!); 


W2MATRIXUQD2.0.0I); 

W3:M  AYR1X  ( I0.QD3.0)); 

W4.MATSIX<K}D4.0.0|>; 

W5:MATRlX(|0.(}D5.a|); 

W«.MaTRIX((0.OI)6.O|): 

/•SET  UP  ROTATION  MATRICES  7 
RI.ROTY(QI): 

R2:ROTX(02): 

R3:ROTY(Q3); 

R4:RWTX(Q4I; 

RS.ROTYIQJ); 

/•STAGE  I.  SET  UP  ANGULAR  VELOCITIES  7 
WAEXPANDIWI  .  Rl); 

WB  EXPANDIWI .  Rl .  R2+ W2 .  R2); 

WC:W3: 

WD:EXPaND(WI  .  Rl .  R2 .  R3-FW2 .  R2 .  R3-PW3  .  R3>; 

WE  EXPANDIWI  .  Rl  .  R2  .  R3  .  R4  + W2  .  R2  .  R3  .  R4+ WJ  .  R3  .  R4-I  W4  .  R4): 
WF:EXPAND(WI.R1.R2.R3.R4.  R5+W2.R2.R3.R4.R3+W3.R3  R4.RJ+ W4.R4.R5+W5  R5); 

/*  SET  UP  BASE  VECTORS  AND  CROSS  PRODUCT  7 
AA:MATRIX(|AAI,AA2.AA3I); 

BB.MATRIX(!BBI.BB23B3|); 

CC:MATRIX(ICCI,CC2.CC)I); 

DD.MATCIX(!DDt.DD2.DD3)h 

EEMATXIX(|EEI.EE2.EE3I); 

FF.MATRIX(|FFI,FF2.FF3|); 

CROSS(A.B.BASE):“BLOCKIH.MATRiy.<lA(l.2l'Bll.3l-All.3l*B|l,2l. 

HAll.ll*SlUMH.3l*BII.I1).All.ll‘Btl.2l-All.2l’R(t.lli>>: 

/*  LENTHE  VECTORS  FOR  VELOCITIES  7 
V ECU  MATA IX(lLt.O.OI); 

VECQtMATRIXIf0.Q6.0l): 

VECLS.MATRIXdO.LS.Ol); 

VECL2:MATRIX(f0X2.0|); 

VECL6-MATRIXIIO.L6.0I): 

VECU:MATRIX(|0X3,0I): 

VECL4MATRIX00.L4.0l); 

/•STAGE  2.  SET  UP  UNEAR  VELOCITIES  7 
VAMATRIXGO.O.O!); 

RB:MATRIX(tLI.L4,01); 

VB.CROSSIWA.RB.AA): 

RC:VECQ6+VECLI .  R2: 

VCCROSSIWB.RC.CC); 

/•ADD  UNEAR  COMPONENT  7 
VC:’/C+W6; 

RI):VECLt .  R2+VEOQ6+VECL5; 

VD. CROSSIWB.RD.CO; 

/•ADD  UNEAR  COMPONENT  7 
VD:VD+W6; 

/•FOR' VF  START  WITH  VELOtT.  V  OF  C  V 

VE. EXPANDCVC .  R3 .  R*)+ CROSS (WE-W4.VECL2 .  A3  .  P.4.EE)+CAOSS(WE.VECL6.EE>: 
/•REPL  ,vE  16  BV 13  IN  ABOVE  FOR  VELOCITY  OF  F7 
VFJIATSUBST(L3.L6,%);  . 

/•STAGE  J.  SET  UP  PARTIAL  ANGULAR  VELOCITIES  7 
FOR  I  THRU  6  DO  LD3PLAYIWaR|I):RATCXM:F(WA.II|I|I); 

FOR  I  THRU  6  DO  WBR|t|:RAKUI-:FtWB.UIII>; 

FOR  I  THRU  6  DO  WCRUIXATCOEFtWC.UlIl): 

FOR  I  THRU  6  DO  WDftUlXATCOEFIWD.Ulll); 

FOR  I  THRU  6  DO  WERIUXATCOEFIWE.UII1). 

FOR  I  THRU  6  DO  W.FR|l1:RATCOEF(WF.Ulll); 

/•STAGE  4.  SET  UP  PARTIAL  LINEAR  VELOCITIES  7 
FOR  I  THRU  6  CO  VARfltXATCOEFiVA.UIII); 

FDR  I  THRO  6  DO  VBRm.RATCOEF<V8.U|ll): 

FDR  I T5 IRU  6  DO  VCRll|:RATOOEF<VC.UIIi>; 

FOR  I  THRU  6  DO  VDR|I|:RATC0EF(VD.U!M>: 

FOR  I  THRU  6  DO  V£R|t|:RATCOEFfVE.U!li): 

FOR  I  THRU  6  DO  VFRlll:RATCOEF(VF.Ulli>; 

/•STAGE  5.  FIND  THE  ANGULAR  ACCELERATIONS  7 
ALPHA  A:DIFF(WA.T)$ 

ALPllAB:DIFr(WB.T)$ 

ALPIIAC:DIFF(WC.T)* 

ALPIIADJHFFIWD.TTS 

ALPHAE.DIFF(WE,T)$ 

ALHIAF:D«F'.WF.T)S 

/•STAGE  6.  FIND  THE  LINEAR  ACCELERATION  •/ 

ACCAXHFFiVA.T); 

ACCRDIFF(VB.T)+CROSS(WA.Vi.AAIJ 

Acccmrx  vc.T) + cross'  wb.vc.b*)  j 
ACCD:CIFF(VD.T)  -FCROSS(WC.VD.CC)* 

ACCE:INFF(VE.T) +CROSSIWE.VE.EEI* 

ACCF.IHKFIVF.T) +CRnSS(Wt-.VF.rE)S 
/•STAGE  7.  MOMENTS  OF  INERTIA  7 
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IA:MATRIXI|IAI.1A2.IA3I>; 

IBMATKIXUlBt.l32.1BJl); 

It*  MATRIXIIICI.IC2.IC31); 

IP  MATRIXIllUI.IU2.IDJI): 

IE:MATRIX<llbl.lE2.IR3|>; 

IIF.MATRIXI1IIFI.IIF2.IIFJI): 

/•STAIIF.  8,9.10.  DEFINE  TORQUES.REACTIONS.  AND  OENFRAI.IZF.D  FORCES  FOR  A.B.C.D.E.P/ 
1  )S:-ALPHAATA-CROSS(WA.lA*WA,AA>» 

RAS:-MA‘ACCAt 

FOR  I  THRU  6  DO  LDISPLAY(KAS|I):WaR|I1  .  TAS+VARlIl .  RAS)S 

TBS'-ALPIIAB‘IBCROSSiWB.lB,WB.BB>S 

RBS'-MB'ACCBS 

FOR  I  THRU  6  DO  KBSttj:WBRll] .  TBS+VBRll! .  RBSS 

TCS:-ALPHAC-ir-CROSS<WC.IC*WC.CC)S 

RCS  -MC-ACCCJ 

FOR  I  THRU  6  DO  KCSlll:WCRlll .  TCS+VCR1II .  RCS* 

TDS:-ALPIIAD*ID-CROSStWP.ID,WD,DD)S 

RDS:-MD*'ACCDS 

FOR  1  THRU  6  DO  KDSlH.WDRIIl .  TDS  +  VDRill .  RDSS 
TES  -ALPHAE*IE-CROSS(WE.IE*WE.EE)S 
RES  -ME’ACCEJ 

FOR  I  THRU  6  DC  .ESill.WERII) .  TES+VERIIl .  RESS 
TFS:-ALPI|AFT1F.CR0SSIWF.IIF*WF,FF)S 
RIS:-MPACCFt 

FOR  I  THRU  6  DO  KFS|I|:WIX|II .  TFS-f  VFRII1 .  RFSt 
/•SUM  AIL  CORRESPONDING  GENERALIZED  FORCES  7 
KKLKASM+KBSIUtKCSIll+KUStlJ+KESM-fKFSMS 
KK2.KASIil  +  KBSl2l  +  KCSl2l  +  KDS[2l+KCSl2l+KFSl2!$ 
KK):KASI3]+K5S|3]+KCSI)|+KDS|3I+KESIJI+KFSI3]J 
KK4:K  ASM)  +  KBS14I  +  KCS|4|  -FKDSI4I  +  KES141 +KFS141S 
KKJ.KASl5l  +  KDS!J]  +  KCSl5l+KDSl5|+KESl5l+KFSf5l$ 

KKt.K  AS|6l  +  KBSIM  +  KCSIM  +  KDSW  +  K.ESI6)  +KFS16IS 
/•STAGE  10.  SET  UP  ACTIVE  FORCES  V 
GA.MATRIXIIO.-G’MA.O)): 

GBMATRIXHO.<i*MB.O]l; 

GC‘-G*MC*MATRIX(IO.CC2.-S21); 

GD:*G*MD*MATRIX(10,CC2,-S21)' 

OE-O*ME‘MATKIX<l0.l.0|)  .RI.R2.R3.R4; 

.  R2  .  R3  .  R4: 


Rl  . 


GF:^)’MPMATRIX(|0.I,01) 

TNA  MATRIXI10.TAUI.0I); 

TBA.MATRIXIITAU2.0.0I): 

TCBMATR1XH0.-SK3MA.0I); 

TDCMATRIX(i0.TAU3.0l>; 

TED.MATRIX1tTAU4.0.0ll; 

TFEMATRIXI10.TAI15.0I); 

RNA:MATRIX(10.0.0|); 

/•SET  UP  GENERALIZED  ACTIVE  FORCES  •/ 

SPEC1AL2IRI:*  BLOCK!  IF  R  -  6  THEN  -SIGMA  ELSE  0); 

KTOTALR(Rl:“SPEClAL2lR| +WARIR1 .  TNA+(WAR|R| .  R2-WBR1RI) .  TBA 
+  1WCR1R1  .  R3-WDR1RI) .  TDC .  R3+(WDR(R| .  R4-WERIR1) .  TED  R4 
+  IWER1R1 .  R5-WFR1RI) .  TFE .  R5  fVBRlRl .  GB+VCRlR)  .OC+VDRIRl .  GD+ VERlR) 
KEEPFLOATiTRUE; 

/•NUMERICAL  EXAMPLE  WITH  VALUE!  GIVEN  IN  REF.  (91V 
Oft  I:  LIU:  L2:.«;  L3-.2;  L4-.I;  UO.7;  L6O.06;  MA:9;  MB:*;  MC:4;  MD;I; 

Mb:0  6;  MF.0.5:  IAIO.01;  IA2.O.02;  IA3O  0I;  IBW.06;  IB2D.0I;  183:0.05; 

IClfl.4;  10.001;  10.0.4.  IDIO.OOOJ;  ID2000I:  ID30.001;  IEI0.0005: 

IE200002;  IF30.0005;  IIFIOOOI;  IIF2-0.002;  IIF3OC03; 
EV(FT:(T-lQ/(2*%P1)*SIN(2*WrT/10)),(%Pt/180),NUMER); 

TQI*0/IO*FT; 

TQ2.WPi/2+(M-90)/l0*FT; 

TQ3:TQI; 

TQ4:TQI; 

TQ5-TQI; 

TQ&1/I0; 

UUl.DIFF(Tt)I.TI*SINtT02)*SIN(TQ3)+niFF(TQ2,T)*CGS(TQ3); 
Ul21.D1FF(Tgl.T)‘COStT02)  t-DIFF<TQ3.T); 
Ul31;DlFFCrgi,r)'SIN(TO2)*COS(TQ3)+DIFF(T0U),SIN(T03); 

U|4l:DIFF(T04.T); 

U|51:DIFF(TQ5.T); 

Ul*l:DIFF(TQt.T); 

QI:TQI; 

QJ.TQ2; 

C*  ’Q3; 

L  -  i  4; 

*  *  S; 

X! 

/•  NOW  WF.  PLOT  RESULTS  AND  COMPARE  WITH  REF.  (91 V 
FINAL*  KTOTALRW  +  SKJMA+KK*; 

FIHAU;EVIF1NAL*.DIFF)S 
EQUALSCALEFAI SE; 


PLOTNUMHO; 

PlOT(F1NAL6.T.O,IO,*PLOT  OF  SIGMA  *). 

F1NAL5;EV(RKS.DIFF)» 

PLOT(FINAU.T.O.IO."?LOT  OFTAU  S*); 
F1NAL4:EVIKK4+KTOTALR|4|+TaU4.DIFF>» 

PLOT  (FIN  AL4.T.0.10 ‘PLOT  OF*  TAU  4*1: 

FlNALJ.EVIKK2*KTC»rALRM  4  I  AtIJ.DIITIS 
PLOr(FiNAL3.T,O.IO,*PLOr  OF  TAUJ  *): 

/TRY  TO  SIMPLIFY  AND  COLLECT  TERMS  X  I J  IN  EQUATION  OF  MOTION*/ 
/‘FIRST  DELETE  NUMERICAL  VALUES  V 

FOR  1:1  QRU 1 00 1  FOR  3:1  QRII  t  DO  XXAlUl.R  ATC0EFF(KAS|II.I1IFI1UUI.TH): 
FOR  1:1  QRU  t  IX)  ( FOR  3:1  QRU  6 IX)  XXBtl.J|:RATCOEFFTKBStl!.UIFFTUUI.TI>l; 
FOR  1:1  QRU  S  DO  ( FOR  5:1  QRII  6 IX)  XXCll.3:  L.vTCOEFFTKCSIll.DIFFlUlJ)  Till; 
FOR  1:1  QRU  i  DO  1  FOR  1:1  QXU  6 IX)  XXDlUl:RAfCOF4>(KDSII|.l)IFi!UlJI.Tli); 
FOR  1:1  QRU  *  DO  (  FOR  3:1  QPU  6 IX)  XXEIUl:RATCOEFFIKES!ll.DIFFTUU).T»>: 
FOR  1:1  QRU  *  DO  (  FOR  1:1  QRU  « IN)  XXFIi.J)  RAYCOEFFlKFSlII.DIFFHJUl.T)!); 
FOR  1:1  QRU  *  DO  (FUR  3:1  QRU  *  IX)  XXXIU|:RaTS1M;’IXXA|U1+XXR!UI  + 
XXCIU1+XXDIIJI+XXE1IJJ+XXF1I.JI)): 

FOR  1:1  THRU  i  DO  (  FOR  3:1  THRU  6 IX)  (.DISPLAY  (XXXll.HI): 

/•COMPARE  ABOVE  X.lj  WITH  THOSE  OF  REF.  191*7 
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APPENDIX  IX 


/•SPACECRAFT  EXAMPLE*/ 

/• . CARTESIAN  l)IV  AND  CURL  DEFINITION- . 

. UNIT  VECTORS  ARE  81  B2  B3 . SEE  LEVINSON  */  . 

/* . DEFINE  DOT  AND  CF.OS3  PRODUCTS.  .*/ 

DOT(VI.V2):-BLOCK(|P.PP). 

FOR  1:1  THRU  J  DO  PM:RATCOEFF<VI,Blll». 

FOR  1:1  THRU  J  DO  PPll|:RA7COEFF(V2,flll)>. 

P|4|:SUM(P|II*PPII|, 1,1.31. 

RETURN(P|4|»S 

CROSSIVI.V2>:-BLOCK<|P.PP.PPPi. 

FOR  1:1  THRU  3  DO  PlIl  RATCOEFl  IVI.BlII). 

FOR  1:1  THRU  3  DO  PP(II:RATCO£FF<V2,Bim, 

PPF|I|:(P|2I,P?13|.P01,PPUI). 

PPP|2|:<-P|I)*PP|3I+P|3I*PPIII). 

PPP|3I:IP1U*PP|2I-PI2|*PP|I|). 

PPP|41.BII|*PPPII.I+8I2|‘PPP|2|+8I3I*PPPI3), 

RETURN!PPP|4|))S 

/• . NOW  WE  INPUT  EQUATIONS  FROM  LEVINSON  S  PAPER  •/ 

DEPENDSIU.TI: 

DEPENDS(Q.T). 

Rl2ICOS(Q)*BI2|-FSIN(Q)*al3l: 

RI3l:-SIN(Q>*Bl2l+COS(Qi*8|31; 

WB,tl|  I|*B|I  I + UI2|*R|2I  -*■  U|3|”B13|; 

QERIVABBREViTRUE: 

Ui4|.OIFF(Q.T>: 

WR:(Ullf+U|4l)*B|II-f  U!2)‘BI2)+U131*Bi3): 

ALPBDIFF(UII|.T),BlH+OIFF<UI2I.T),Bl2l+D:FF(UI3i.T)‘B!3l: 

ALFR:DIFF(WR.TI+CROSSlWB.WR); 

PPItS:Bl*Bll!+B2*RI21+B3*8|3|: 

PPXS:R!,B!li+R2*Ri2l+R3"Rl3l; 

PRSBS:PPft»PPRS; 

VBS:U|5I*3[I  l+UWBUi  +  U|7I*B|3I; 

VRS.VBS  FDWIPRSP^.T).  .lOSSIWR  PRSBS); 

ABSPIFtt’-'EV'l+CFO^. 

AR5  DIFXv  AV  +CR-.:« '  *k  Vt,-,; 

IBBSWB.3ET|,Blli,DOT(»lll.W'1».+8ET2'BI2l'DOTtll2l,WB)  +  BET3,BI3l,DOT(BI3!.wa); 
IRRSWRRIIOI*B:II*DOT«I!,.WR,+RHO  ’Bl2i*DaT<ll2l.WR>+RIK)3'Bl3l*DOTIBU!.WR>: 
IBB$ALPIFRFm*B|l|*DOT(Bill.ALPBH  «  T2*Bl2l*DOT»Bl2I.ALPB)+BET3*BDi*OOTlBI3j.ALPB): 
IRRSALPR:RIIOI*BIII*tX3T(Blll.ALPRI-FRH0}*R|»’00T(Bl2l.ALPR)-FRIIO3'BDJ*DOT(BI3I.ALPRI: 
FSB'.MD'ARS: 

FSR:-MR'ARS: 

TSBCROSS(IBSSWB,W8)-IBBSALPB; 

TSRCROSSdRRSWR.WRMARSALPR; 

F'JF',BHl+F2,B(2)+F3,Bl3): 

TB.TI,i|S|+Ti,BI2|-»T3*BlJI: 

FIR:  ’DOT(DIFFIVBS.U!Rll.FE)+DOT(DIFF<WB.UlRI),TB); 
FS|R|:-DOTIDIFF(VBS.«IKI).F5B>+OOT<DUTIVRS.UIRH,FSR) 
+DOTIDIFF(WS,UlRl).TSB)-FOOT(D.  T(W»,U!Ri),TSR); 

EQHI:-F|RKFSIRI: 

EQIH; 

I  OR  1:1  THRU  7  DO  LDISPLAY  (  Xli  JI*ATCOEFF'(EQll|.DIFFIUIII.T»); 


•!%/* 
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Dynamic  Instability  of  the 
Flexible  Coupler  of  a  Four-Bar  Mechanism 

Iradj  G.  Tadjbakhsh* 

Abstract 

Dynamic  behavior  of  flexible  components  of  mechanisms  is  prone  to 
instabilities  which  create  resonant  speed  barriers.  By  considering  small 
deformations  superimposed  on  the  steady  dynamic  state  equations  governing 
evolution  of  disturbances  can  be  obtained.  For  the  case  of  mechanisms 
driven  by  periodic  inputs  these  equations  reduce  to  a  system  of  coupled 
Mathieu-Hill  equations  for  the  amplitudes  of  modes  of  vibrations. 
Application  of  the  Floquet  theory  determines  the  critical  conditions  of 
speed,  geometry  and  material  properties  causing  dynamic  instability. 

1 .  Introduction 

A  primary  cause  of  instability  in  linkages  is  the  flexibility  of  its 
members.  Under  steady  operations  periodic  time  dependent  axial  forces 
act  on  these  members  that  may  lead  to  instability  characterized  by 
unbounded  growth  of  small  disturbances.  The  situation  is  similar  to 
dynamic  buckling  of  columns  under  the  impact  of  periodic  time  dependent 
loads  [1-2].  Some  recent  investigations  into  dynamic  stability  of  flexi¬ 
ble  mechanisms  is  reported  in  references  [3-5], 

In  this  paper  we  consider  the  dynamic  instability  of  the  coupler  of 
a  four-bar  mechanism  with  equal  input  and  output  crank  lengths.  This 
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Mx  -  R  e  x  Fi  «  Ji  0  k 


(4) 


M2  -  R  e  x  F2  *  J2  8  k 


(5) 


K 


in  which  Fj  and  F2  are  the  end  forces  in  the  coupler,  k  *  i  x  j,  and 

^  a  ^ 

Mj  (■  Mj  k)  and  M2  (*  M2  k)  the  externally  applied  couples  on  the  wheels. 

A  A  A 

Also  Ji,  J2  stand  for  the  mass  moments  of  inertia  of  the  wheels,  Y  (*pA) 
and  l  the  mass  per  unit  length  and  the  total  length  of  the  coupler  and 
e  *  cos  0  .  i  +  sin  0  .  j  the  unit  vector  along  OA. 

a>  e. 

Eqs.  (2)-(4)  constitute  five  scalar  equations  for  the  determination 
of  the  four  unknowns  comprised  of  the  components  of  Fi  and  F2.  Thus  we 
find  for  Fi 


Fi  *  (cot  0  .  $  - 


Mi  -  Ji  0 


R  sin  0  '  * 


•>  i  +  *  j 


(6) 


1 ft 
8 


where 


♦  »  y  R  i  Y(“8^sin0+0co80)  ~  &  (0^cos0+0sin0)  /*  v  ds 


+  T  ^0  (1  "  f5  *  ds 


This  result  is  valid  provided 


t  — 


( J 1  +  J2  +  R  i  Y)  0  +  Y  cos  0  /  v  ds  *  Mi  +  M2 

0 


(7) 


(8) 


which  is  also  a  consequence  of  (2)-(5). 

the  stress  resultant  f  the  axial  force  T  and  the  transverse  shear  N 
in  the  coupler,  at  a  point  s,  are  given  by 


Restricting  subsequent  considerations  to  the  steady  case,  0*0, 

$  *  o»t,  we  select  0  as  the  independent  variable  replacing  t.  Then  (14) 
becomes 

EIv  ~p I(i)2 vgg+yoi^Y  cos0  /*V  ds  +  7  /* s  vgg  ds  -  /*Vgg  ds 

*  rt  *  A  A  ^ 


JL 

-[R(j  -  s)cos0  +  ^-g]v‘)«  -  YW2R(j  -  s)cos0 


(15) 


The  boundary  conditions  that  accompany  (15)  are 


v(O,0)  -  v(t,0)  -  0 


v"(O,0)  -  v"(t,0)  -  0 


(16) 


We  shall  attempt  to  obtain  the  solution  for  y  for  a  given  constant  input 
moment  Mj.  The  output  moment  M2  is  then  obtained  from  (8). 

3 .  Approxirate  Solution 


Employing  Galerkin  procedure  we  assume 


N 


v  »  l  8^(0)  sin  — 
m*l  1 


(17) 


which  satisfies  (16).  Substitution  into  (15)  and  minimizing  the  error 
leads  to  the  solution  of  the  system 


£  +  I  <5  +  V  &  +  2wr2cos 0  E  «j  -  2yr2  sin0.f 


(18) 


Here  g  -  (gx,  g2,  ...  gfl)*,  y  -  R/Jt,  01  *  M^/REI,  X  -  PJt2w2/E  and 

a. 

r  *  i(A/I)*/2  £S  the  parameter  defining  the  slenderness  of  the  coupler. 

The  matrices  C  and  D  are  diagonal  with  elements  C-™.  *  1  +  r2(m*)“2, 
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Dam  *  (mir)2.  The  matrix  I  is  the  unit  matrix  and  E  is  a  symmetric  zero- 

-  ~ 

diagonal  matrix  with  elements 


Eon  -  (1  "  (-D®^) 

r2(m2-n2)2 


Finally,  components  of  the  vector  f  are  given  by 


!-(-!)* 

(m*)2 


m  ■  1,2,3  ...  N 


Limits  of  stability  of  solutions  g  of  (18)  are  determined  by  the  be- 
havior  of  the  solutions  to  the  homogeneous  form  of  (18)  which  is  written 
in  the  form 


sin0  C  g"  +  X“l(sin8.D  +  m^l)g  +  pr2  sin20  E  g  ■  0 


It  can  be  shown  that  system  of  differential  equations  with  periodic 
coefficients  of  period  2 t  admit  of  solutions  of  periods  It  and  4x  [6] • 
More  precisely,  in  the  case  of  (21),  the  regions  of  dynamic  instability 
in  the  four-dimensional  parameter  space  of  X,  mj,  r  and  p  are  confined  by 
surfaces  S(X,  mj,  r,  p)  *  0  for  which  periodic  solutions  of  the  same 
period  exist.  Regions  bounded  by  surfaces  which  correspond  to  solutions 
of  different  periods  contain  parameter  values  of  stable  solutions  [2]. 

With  a  view  to  constructing  solutions  of  period  2ir  we  set 
N 

g  *  7  £o  +  I  £k  cos  k9  +  Bfc  sin  k©  (22) 

k-1 


The  constants  Aq,  A^,  B^,  k  -  1,2,3,  ...,  can  be  determined  by  substitut- 

A 

ing  (22)  into  (21)  and,  in  the  resulting  equations,  equating  coefficients 

of  various  Fourier  components  to  zero.  In  this  way  a  system  of  algebraic 

equations  are  obtained  the  first  three  of  which  have  special  forms  and 
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thereafter  nay  be  given  with  a  general  formula.  These  are: 


aj  Aj  +  (D  •  i  C)  Bi  +  yr2  X  &  *2  *  0 

e#  A  A  A  A  A 

2m]  Ai  +  yr2  A  E  B]  +  (D  -  4X  C)  B2  +  yr2  E  B3  ■  0 

A  A  A  A  A  A  A  A 

D  ^  +  yr2  X  E  Ai  +  2ml  B],  +  (4XC  -  D)  A2  -  yr2  X  E  A3  «  0 

A  A  A  A  A  A  A  A  A  A 

-  u r2  X  E  Bjc-2  +  [(k-1)2  X  C  -  D]  Bk-i  +  2mi  Afc  + 

A  A  A  A  A  A 

[D  -  (k+1)2  X  C]  Bic+i  +  yr2  X  E  3k+2  *  0,  k  -  2,3,.... N 

A  A  A  A  A 

yr2  X  E  Afc-2  +  U>  “  (k-1)2  XC]  Afc-i  +  2mi  Bk  +  (23) 

A  A  A  A  A  A 

[(k+1)2  X  C  -  D]  Av+l  “  yr2  X  E  Afc+2  *  0,  k  »  2,3,....N 

Existance  of  a  solution  to  the  homogeneous  system  (23)  requires 
vanishing  of  an  infinite  determinant.  In  order  to  study  the  problem 
approximately  but  in  a  systematic  way,  we  use  a  hierarchy  of  subsystem  of 
increasing  order  in  the  number  of  unknowns.  As  the  smallest  subsystem  we 
consider  the  case  when  the  only  non-zero  elements  are  Ag  »  [Agi  ] ,  A]  » 

A  A 

[An],  Bi  »  [Bll],  Then  by  considering  only  the  three  leading  equations 
in  the  set  (23)  we  obtain  An  »  0  and 


®1  Aqi  +  (Du  ”  *  Cn)  Bn  »  0 


D11  AOl  +  2ol  B11  *  0 


(24) 


This  implies 


2m2 

] 


14  -  (x2  +  r2)  X 


(25) 


which  is  a  parabola  in  the  X-mi  plane.  The  next  larger  and  more  accurate 
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have  a  special  form  and  thereafter  may  be  .given  by  general  formulas* 
These  are: 

8mi  A!  +  (4D  -  1C)  Bi  +  [<4|i  r2  E  -  9C)  X  +  4D]  B3 
+  4y  r2  X  E  B5  *  0 

(4D  -  XC)  Ax  +  8mi  Bi  +  f (9C  +  4y  r2  E)  X  -  4DJ  A3 
-  4y  r2  E  A5  -  0 

[(C  +  4y  r2  E)X  -  4D]Bi  +  8mi  A3  +  (4D  -  25X  C)Bs  +  4y  r2  X  E  B7  -  0 
[(4D  -  XC)  +  4jir2XE]  Aj.  +  8mi  B3  +  (25X  C  -  4D)  A5  -  4yr2  X  E  A7  -  0 

<V  A  A  ^  ^  ^  ^  ^  ^  ^ 

-  4y  r2  X  E  Bk-4  +  [X(k-2)2  C  -  4D]  Bk-2  +  3*1  *k  +  (31) 

<s  a.  <s  ^  ^ 

[4D  -  X(k+2)2  Cl  Bfc-2  +  4y  r2  X  E  Bk+4  -  0f  k  »  5,7,9,  ... 

A  A  ^  ^ 

4y  r2  X  E  Afc-4  +  [4D  -  X(k-2)2  C]  Afc-2  +  8«i  Bk  + 

»%  ^  A  ^  ^  ^ 

ttk+2)2  X  C  -  4D]  Afc+2  -  4y  r2  X  E  Afc+4  -  0,  k  -  5,7,9,  ... 


Corresponding  to  a  subsystem  consisting  of  only  Ajj,  Bn  A31,  B31 
with  all  others  being  zero,  one  obtains  t  te  characteristic  equation 


64  mi  -  (4Dn  -  9X  Cn)(4Dn  -  X  Cn)  "  ±  8mi(4Dn  -  X  Cn) 


(31) 


Fig.  3  shows  the  results  of  the  analysis  in  the  X-mi  plane.  These 
results  are  based  on  formulas  (28)-(29)  and  (31).  Boundaries  of  the 
regions  of  instability,  shown  as  shaded,  are  solid  lines  when  based  upon 
2  it -periodic  solution  and  dashed  lines  when  based  upon  4ir-periodic  solu¬ 
tions.  The  values  of  y  *  0.4  and  r  *  100  were  used. 
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AN  INTRODUCTION  TO  THE  SCIENTIFIC  COMPUTING 
LANGUAGE  PASCAL-SC 


L.  B.  Rail 

Mathematics  Research  Center 
University  of  Wisconsin-Madison 


ABSTRACT.  Microcomputers  are  now  being  widely  used  for  small-scale 
scientific,  engineering,  and  statistical  computing.  Pascal-SC  (Pascal  for 
Scientific  Computing)  is  a.  language  which  has  been  developed  specifically  for 
this  application.  Its  most  important  features  are:  (i)  accurate  floating-point 
arithmetic  for  real,  complex,  and  interval  numbers,  vectors,  and  matrices,  with 
controlled  rounding  if  desirec;  (ii)  the  convenience  of  operator  notation  for 
numerical  data  types,  which  nukes  programs  easier  to  write,  read,  and  document, 
together  with  the  ability  to  accept  user-defined  operators  for  nonstandard  data 
types;  and  (iii)  compatibility  with  ordinary  Pascal,  so  that  Pascal  programming 
techniques  and  programs  already  written  in  Pascal  can  be  used  immediately.  In 
Pascal-SC,  solutions  of  linear  systems  of  equations,  inverses  of  matricos,  and 
eigenvalues  and  eigenvectors  are  computed  with  guaranteed  error  bounds,  and 
scalar  products  of  vectors  and  sums  of  arbitrary  length  of  floating-point 
numbers  are  confuted  to  the  closest  floating-point  number,  or  rounded  as 
desired.  These  basic  features  of  Pascal-SC  will  be  described,  together  with 
applications  to  research  on  numerical  methods  which  have  been  carried  out  on  a 
microcomputer . 

1.  OBJECTIVE.  A  brief  description  of  the  language  Pascal-SC  (Pascal  for 
Scientific  Computation)  will  be  given  to  explain  features  of  this  extension  of 
ordinary  Pascal  [10]  which  are  particularly  useful  for  scientific,  engineering, 
and  statistical  calculations,  particularly  on  microconf uters .  The  reader  is 
assumed  to  be  familiar  with  Pascal  programming  on  at  least  an  introductory 
level,  and  to  have  had  some  experience  with  numerical  confutation  of  the  kind 
which  arises  in  scientific  and  engineering  problems.  With  this  background,  it 
should  be  easy  to  appreciate  the  advantages  of  the  additional  features  of 
Pascal-SC. 

In  order  to  keep  the  discussion  short  and  to  the  point,  many  details  will 
be  omitted,  and  a  formal  description  of  the  language  will  not  be  given.  For 
operational  and  programming  details,  the  reader  should  consult  [3],  [19],  [28); 
formalities  are  given  in  [5].  Here,  simple  exanfles  will  be  used  to  illustrate 
ideas  as  they  are  introduced. 

2.  WHY  PASCAL-SC?  As  pointed  out  by  Wirth  [10],  the  introduction  of  a  new 
computer  language  requires  careful  justification.  The  same  applies  to  an 
extension  n  modification  of  an  existing  language,  particularly  a  language  which 
is  as  successful  and  widely  used  as  Pascal.  The  most  important  additional 
features  of  Pascal-SC  are: 
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(i)  Accurate  floating-point  arithkatio  wit h  controllable  rounding; 


(ii)  User-defined  operators  to  facilitate  programming  and  documentation . 

Furthermore,  it  is  of  considerable  importance  to  note  that: 

(iii)  Pascal-SC  retains  the  features  of  ordinary  Par  al. 

Thus,  none  of  the  investment  in  learning  to  program  in  Pascal  or  in 
programs  already  written  in  Pascal  is  lost  in  going  from  Pascal  to  Pascal-SC- 
Tie  Pascal-SC  compiler  can  translate  programs  written  in  ordinary  Pascal* 
Moreover,  programming  in  Pascal-SC  will  come  very  naturally  to  the’  Pascal 
programmer,  as  will  be  seen  from  the  examples  given  below* 

With  regard  to  (i),  the  floating-point  arithmetic  provided  by  Pascal-SC  is 
implemented  not  only  for  real  floating-point  numbers  (type  REAL),  but  also  for 
complex  numbers,  intervals,  and  vectors  and  matrices  of  these  types.  This 
allows  convenient  and  accurate  computation  with  the  kinds  of  numerical  data  most 
frequently  encountered  in  scientific  and  engineering  problems.  Tne  floating¬ 
point  arithmetic  of  Pascal-SC  [28]  is  constructed  on  the  basis  of  the  theory  of 
computer  arithmetic  given  by  Kulisch  and  Mil  anker  [14],  which  guarantees 
accuracy,  controllability  and  reliability  of  the  results.  In  order  to  keep  the 
compiler  small  enough  to  be  convenient  to  use  on  microcomputers  and  still 
provide  these  additional  features,  extensive  use  is  made  of  external  libraries 
of  pretranslated  code  which  the  compiler  can  link  to  the  user’s  program,  or 
source  code  which  can  be  compiled  as  part  of  it. 

The  second  important  additional  capability  of  Pascal-SC  is  that  it  allows 
user-defined  operators  to  permit  the  manipulation  of  nonstandard  data  types  in 
ordinary  mathematical  notation.  For  example,  if  A  is  a  matrix,  and  x,  b,  c  are 
vectors,  then  the  programmer  can  write  the  statement 

(2.1)  c  :=  A*x  +  bj 

in  Pascal-SC  to  perform  the  indicated  calculations.  This  notation  follows 
oxdinary  mathematical  usage,  and  thus  has  the  advantages  of  clarity  and 
simplicity  compared  to  the  calling  of  procedures  and  functions  to  obtain  the 
same  result  in  ordinary  Pascal.  In  order  for  the  Pascal-SC  compiler  to  accept 

(2.1) ,  the  heading  of  the  program  has  to  contain  a  definition  of  the  binary 
operators  *  to  perform  matrix  by  vector  multiplication  and  +  to  perform  vector 
addition.  (Source  code  for  these  operators  is  included  in  the  Pascal-SC  package 
for  vector  and  matrix  arithmetic. )  In  addition  to  this  "overloading"  of 
standard  operator  symbols,  Pascal-SC  permits  the  user  to  give  operators 
arbitrary  names  (for  example,  XOR  for  "exclusive  or"),  and  assign  priorities  to 
such  operators.  One  particular  convenience  of  Pascal-SC  is  that  the  operator  ** 
can  be  defined  to  perform  exponentiation,  which  makes  the  w-iting  of  polynomials 
and  other  functions  containing  powers  simpler  than  in  ordi.  *y  Pascal.  In 
allowing  user-defined  operators,  Pascal-SC  is  similar  to  Ai  68  and  Ada. 

These  points  will  be  discussed  in  more  detail  in  the  following  sections. 

3.  FLOATING-POINT  REAL  ARITHMETIC.  This  is  the  "built-in"  arithmetic  of 
Pascal-SC  for  floating-point  numbers  (type  REAL).  Since  this  arithmetic  is 
based  on  the  general  theory  of  computer  arithmetic  given  in  [14],  it  is 
accurate,  controllable,  and  reliable.  Before  going  on  to  details,  a  precise 
statement  of  the  meaning  of  these  terms  is  necessary  because  the  related 
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concepts  of  "accuracy"  (the  exactness  with  which  results  are  calculated)  and 
"precision"  (the  number  of  digits  used  in  the  representation  of  floating-point 
numbers)  are  often  confused.  For  example,  the  result  32.0  -  31.0  =  1.00  is 
calculated  with  low  precision  (3  decimal  digits),  but  high  accuracy  (exactly). 

By  contrast,  UNVAC  1100  floating-point  hardware  gives 

134217728.0  -  134217727.0  =  2.00000000, 

which  is  done  with  higher  precision  (9  significant  digits),  but  no  accuracy, 
since  the  correct  answer  is  1.90000000  [21]. 

It  is  possible  to  discuss  the  idea,  of  accuracy  independently  of  the 
particular  precision  used,  since  each  floating-point  system  contains  only  a 
finite  set  of  numbers.  In  a  given  system  S,  two  floating-point  numbers  u,v  with 
u  <  v  will  be  said  to  be  adjacent  if  there  is  no  floating-point  number  w  such 
that  u  <  w  <  v.  For  x,y  €  S,  the  exact  result  xoy  of  ;.r>  arithmetic  operation  o, 
where  o  €  {+,  -,  *,  /},  will  either  be  a  floating-point  number,  or  a  real  number 
w  such  that  u  <  w  <  v,  where  u,v  are  adjacent  f loatim-point  numbers.  In  order 
to  produce  a  floating-point  number  in  the  latter  case,  is  "rounded"  to  an 
element  of  S,  which  should  be  either  u  or  v.  This  is  the  basic  requirement  for 
reliability  of  a  floating-point  arithmetic  operation.  The  floating-point 
software  supplied  for  some  microcomputers  does  not  meet  this  simple  requirement, 
and  neither  does  the  floating-point  hardware  of  some  mainframe  computers.  This 
unhealthy  situation  has  lead  the  IEEE  to  undertake  the  promulgation  of  standards 
for  floating-point  arithmetic  (see  the  SXGNUM  Newsletter  for  October,  1979). 

An  accurate  rounding  R  of  the  floating-point  operation  o  selects  R(xoy) 
equal  to  u  or  v  to  minimize  the  roundoff  error 

(3.1)  e(xoy)  =  | R(xOy)  -  (xOy)|. 

This  is  the  best  possible  answer  (BPA)  rounding  [29] .  In  case  of  a  tie  between 
u  and  v,  a  suitable  rule  is  invoked.  In  I.-scal-SC,  this  rounding  is  to  the  one 
of  u,v  which  is  furthest  from  zero,  in  order  to  satisfy  the  condition  of 
antisyanetry  of  R,  R(-x)  =  -  R(x),  which  is  required  by  the  general  theory  [14]. 

The  distance  ju  ■■  v|  between  u  and  v  will  depend  on  the  precision  of  the 
floating-point  numbers  being  used;  this  determines  the  maximum  roundoff  error  of 
a  reliable  calculation.  Of  course,  if  the  arithmetic  of  the  machine  being  irsed 
is  rot  reliable,  then  roundoff  error  is  not  related  in  a  simple  way  to 
precision,  and  che  attempt  to  "buy"  more  accuracy  by  using  increased  precision 
can  be  futile,  as  well  as  expensive. 

Thore  are  other  ways  in  which  rounding  to  a  reliable  result  can  be  carried 
out,  including: 

(i)  x  is  rounded  upward  to  v; 

(ii)  x  is  rounded  downward  to  u. 

The  directed  roundings  (i)  and  (ii)  are  necessary  to  support  interval 
arithmetic  [17],  [16],  [14],  among  other  things.  Rounding  in  a  floating-point 
arithmetic  is  said  to  be  controlled  if  the  user  can  choose  the  metho^  r  •'.sired 
for  the  result  of  a  given  floating-point  arithmetic  operation.  Pascal-SC  gives 
the  user  the  choice  of  BPA  and  the  upward  and  downward  directed  roundings,  which 
means  that  a  total  of  twelve  operators  are  provided  for  the  four  basic 
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arithmetic  operations  +/  -,  *,  / ,  and  the  three  roundings  listed  above: 


+  -  * 

/ 

-  {  BPA  rounding-  } 

(3.2) 

+>  ->  *> 

/> 

{  Upward  rounding  } 

+'.  -<  *< 

/< 

{  Downward  rounding  } 

Thus,  the  Pascal-SC  programmer  can  control  the  direction  of  rour.ding  if  desired, 
for  example,  to  obtain  guaranteed  lower  or  upper  bounds  for  the  values  of 
arithmetic  expressions  [6] .  It  also  follows  from  the  reliability  of  Pascal-SC 
arithmetic  that  the  addition  and  multiplication  operators  (3.2)  are  commutative, 
which  is  not  true  for  the  kind  of  floating-point  arithmetic  ordinarily 
encountered. 

For  the  microcomputer  implementations  of  Pascal-SC,  decimal  arithmetic  is 
used,  and  the  precision  of  floating-point  numbers  is  twelve  decimal  digits  in 
scientific  notation,  with  an  exponent  range  in  powers  of  10  from  -99  to  +99. 

The  smallest  and  largest  positive  numbers  are  thus  MINREAL  -  1.00000000000E-99 

(=  10*")  and  MAXREAL  =  9. 99999999999E+99,  respectively.  Zero  is  represented  by 
0  =  0. 00000000000E+00,  as  usual  [28].  The  use  of  dec5.mal  arithmetic  avoids  the 
errors  introduced  by  conversion  between  binary  and  decimal  upon  input  and 
output.  Decimal  values  are  represented  internally  by  two  BCD  digits  per  byte. 
The  precision  of  the  Pascal-SC  floating-point  number  system  for  microcomputers 
is  thus  adequate  for  the  representation  of  most  numerical  quantities  of  interest 
in  scientific  and  engineering  computation.  Furthermore,  since  Pascal-SC 
floating-point  arithmetic  is  reliable,  accurate,  and  controllable,  there  is 
seldom  any  need  for  more  than  12  decimal  digits  of  precision  in  a  given 
computation. 

To  illustrate  the  features  of  Pascal-SC  arithmetic,  consider  the  product 

(3.3)  A  **  (13. 4565432278) (0.000453782392 145). 

The  exact  value  of  A  is  not  a  12-digit  floating-point  number,  so  rounding  will 
take  place  in  the  corresponding  floating-point  multiplication  operations.  The 
floating-point  operation  *  gives 

(3.4)  B  =  1 . 34565432278U+0 1  *  4. 53782392145E-04  =  6. 10634237591E-03, 

as  the  BPA  for  A,  which  is  in  error  by  at  most  ~  =  5.0x10*^,  since  Pascal-SC 
arithmetic  is  reliable  and  the  distance  between  B  and  its  two  neighboring 
floating-point  numbers  is  6  =  10”14.  More  precisely,  (3.4)  establishes  that  A 

f  f 

belongs  to  the  half-open  interval  [B  -  .2.,  b  +  -|) ,  since  ties  are  rounded  away 
from  0.  The  operations  *<  and  *>  with  directed  rounding  give 

C  =  1 . 34565432278E+0 1  *<  4. 53782392145E-04  =  6. 10634237591E-03, 

(3.5) 

D  =  1 . 34565432278E+0 1  *>  4. 53782392145E-04  =  6. 10634237592E-03, 

respectively.  Since  C  =  B,  the  result  (3.5)  shows  that  the  exact  answer  A 
belongs  to  the  interval  [C,D]  =  iB,D]  =  [B,  B  +  6];  therefore,  on  the  basis  of 


(3.4),  A  belongs  to  the  half-open  interval  [B  '  B  +  |)  fl  tB,  B  +  6]  = 

[B,  B  +  ~)  (see  Figure  3.1).  Thus,  this  calculation  proves  that  A  >_  B  and 

A  -  B  <  S.OxlO"1^.  This  gives  a  more  accurate  location  for  A  than  the  BPA 
answer  (3.4). 


B  -  |  B=C  B  +  -|  D  D+| 


Figure  3.1.  A  Detail  of  the  REAL  Floating-Point  Screen. 

In  addition  to  the  twelve  rounded  arithmetic  operations  listed  above, 
Pascal-SC  provides  the  most  frequently  used  standard  functions,  computed  to  BPA 
accuracy  [28] .  In  order  to  keep  the  microcomputer  version  of  the  Pascal-SC 
system  small,  extensive  use  is  made  of  external  libraries,  so  the  congpiler  will 
bring  in  code  only  for  functions,  procedures,  and  operators  actually  needed  by 
the  program. 

5.  FLOATING-POINT  COMPLEX  ARITHMETIC.  In  Pascal-SC,  manipulation  of 
complex  numbers  is  accomplished  by  subroutines  for  operators,  functions,  and 
procedures  which  are  stored  in  external  libraries.  For  this  reason,  the 
declaration  of  complex  numbers  has  the  stereotyped  form 

TYPE  COMPLEX  =  RECORD  RE, IM:  REAL  END; 

Thus,  the  representation  of  a  complex  number  2  in  Pascal-SC  is  in  Cartesian 
coordinates.  For  input  and  output  of  Z,  the  standard  format  is  (Z.RE,  Z.IM). 

In  ordinary  Pascal,  addition  and  other  arithmetic  operations  with  conplex 
numbers  have  to  be  done  by  procedures  and  functions.  In  Pascal-SC.  however, 
operator  overloading  simplifies  notation  in  the  program  considerably.  To 
illustrate  this,  consider  as  a  single  example  the  source  code  to  enable  the 
operator  +  to  add  complex  numbers: 

OPERATOR  +  (A,B:  COMPLEX)  RES:  COMPLEX; 

VAR  U:  COMPLEX; 

BEGIN 

U.RE  :=  A. RE  +  B.RE; 

U. IM  :=  A. IM  +  B.IM; 

RES  :=  U 
END; 

The  actual  coding  for  this  operator  is  essentially  the  same  as  for  a 
function  or  procedure  for  the  same  purpose.  However,  to  add  the  two  complex 
numbers  V,W  and  assign  the  result  to  the  complex  number  Z,  one  writes  only 

(5.1)  2  :=  V  +  W; 

in  the  subsequent  program.  Addition  operators  have  to  be  defined  for  all  pairs 
of  operands  of  types  INTEGER,  REAL,  and  COMPLEX,  since  these  would  occur 
naturally  in  expressions  being  evaluated.  If  K,  R,  C  denote  generic  variables 
of  types  INTEGER,  REAL,  and  COMPLEX,  respectively,  then  six.  addition  operators 
are  needed: 


-^-*V  AkAkJ'.k.'M^UI  JT 


(5.2) 


+C,  K  +  C,  C  +  X,  R  +  C,  C  +  R,  C  +  C. 


Similarly,  six  subtraction  operators 


(5.3) 


-C,  K  -  C,  C  -  K,  R  -  C,  C  -  R,  C  -  C, 


are  required,  as  well  as  five  multiplication  and  five  division  operator.'. : 


(5.4) 


K*C,  C*K,  R*C,  C*R,  C*C, 
K/C,  C/K,  R/C,  C/R,  C/C. 


All  22  of  the  operators  (5. 2)- (5. 4)  are  provided  in  an  external  library  in 
the  form  of  pretranslated  code,  and  the  corresponding  declarations,  for  example, 

OPERATOR  +  (RA:  REAL;  B:  COMPLEX)  RES;  COMPLEX; 

EXTERN  L  155? 

are  available  to  the  programmer  in  an  external  text  file.  Actual  coding  is  also 
simplified  considerably  the  fact  that  the  programmer  can  direct  the  compiler  to 
refer  to  externa?  libraries  for  type  declarations  and  definitions  of  operators 
and  other  needed  functions  and  procedures  [19],  [28].  The  use  of  this  feature 
makes  the  source  code  for  a  Pascal-SC  program  more  compact  and  readable. 

Examples  of  programs  using  such  directives  are  given  in  Appendices  A  and  B. 

All  complex  floating-point  operations  in  Pascal-SC  calculate  the  real  and 
imaginary  parts  of  the  result  to  BPA  accuracy.  For  addition  and  subtraction, 
operators  similar  to  the  one  given  above  for  complex  addition  are  satisfactory; 
however,  multiplication  and  division  require  special  algorithms  to  attain  this 
accuracy  [14],  [28].  For  example,  consider  the  function  CDIV  which  does  complex 
division  by  the  usual  formula: 


(5.5) 


FUNCTION  CDIV(A,B  ;  COMPLEX) 
VAR  DENOM:  REAL; 


COMPLEX; 


U;  COMPLEX; 

BEGIN 

DENOM  ;=  B.RE  *  B.RE  +  B.IM  *  B.IM; 

U.RE  ;=  (A. RE  *  B.RE  +  A.IM  *  B.IM)  /  DENOM; 
U.IM  ;=  (A.IM  *  B.RE  -  A. RE  *  B.IM)  /  DENOM; 
CDIV  ;=  U 


The  Pascal-SC  operator  for  complex  division. 


(5.6) 


OPERATOR  /  (A,B:  COMPLEX)  RES;  COMPLEX; 
EXTERNAL  182; 


has  a  number  of  advantages  over  the  function  CDIV.  Among  these  are; 

a.  Accuracy.  The  results  of  Pascal-SC  complex  operations  are  calculated 
to  BPA  accuracy  in  the  sense  that  their  real  and  imaginary  parts  are  given  to 
BPA  accuracy.  Ordinary  Pascal  functions  and  procedures  for  COMPLEX 
multiplication  or  division,  such  as  CDIV  given  above,  cannot  attain  this 
accuracy  because  of  the  number  of  roundoff  errors  which  occur.  In  fact, 
catastrophic  cancellations  can  happen  which  render  the  results  almost 
meaningless  when  the  ordinary  formulas  in  (5.5)  are  used.  For  example,  for 


(5.7) 


V  =  (1.23456789,  1.23456789), 


W  =  (1.0000123E-05,  1.0000321E-05), 


one  gets 


(5.8) 


V/W  =  (1 . 23454048308E+05,  -1 . 222167946i2E+00 ) , 
CDIV(V,W)  =  ( 1 . 23454048308E+05,  -1 .22216773503E+00 ) , 


where  the  incorrect  digits  of  CDIV(U,V)  are  indicated  in  boldface.  Even  in  this 
fairly  harmless-looking  case,  the  algorithm  (5.5)  has  lost  five  significant 
digits  in  the  imaginary  part  of  the  quotient  in  a  single  division,  while  all  the 
digits  of  the  Pascal-SC  result  for  V/W  are  correct.  To  be  sure,  roundoff  error 
will  also  increase  with  repeated  use  of  the  Pascal-SC  division  operator  (5.6), 
but  at  a  slower  and  more  predictable  rate. 

b.  Deferred  overflow.  In  the  Pascal-SC  algorithms  for  complex 
multiplication  and  division,  overflow  does  not  result  unless  the  real  or 
imaginary  part  of  the  result  is  >  MAXREAL  in  absolute  value,  whereas  overflow 
can  occur  in  (5.6)  in  the  calculation  of  the  intermediate  values  DENOM,  U.RE, 
U.IM,  even  though  the  actual  result  has  real  and  imaginary  parts  which  are 
representable  by  floating-point  numbers.  For  example,  for 

(5.9)  V  =  (3.0E+99,  -1.0E+99),  W  =  (1.0E+99,  -1.0E+99), 

Pascal-SC  complex  division  gives  the  result 

(5.10)  V/W  *  (2. OOOOOOOOOOOE+OO,  1 .00000000000E+00) , 

while  the  subroutine  (5.6)  for  CDIV  overflows  when  trying  to  compute  DENOM. 

c.  Ease  of  use.  For  VAR  U,V,W:  COMPLEX,  the  use  of  (5.6)  allows  one  to 

write 


(5.11)  U  :=  V  /  W; 

in  the  source  code  for  the  program  instead  of 

(5.12)  U  :=  CDIV(V,W) ; 

as  in  ordinary  Pascal.  In  the  case  of  complicated  expressions  involving  complex 
numbers,  the  gain  in  programming  ease  using  ordinary  mathematical  notation  with 
operators  instead  of  function  and  procedure  calls  is  significant.  The  source 
code  is  more  likely  to  be  correct  in  the  first  place,  and  also  will  be  easier  to 
document  and  read  later. 


d.  Compilation  time.  The  function  CDIV  has  to  be  compiled  from  the  source 
code  (5.5)  for  each  ordinary  Pascal  program  which  uses  complex  division.  The 
operator  (5.6),  on  the  other  hand,  is  given  by  pretranslated  code  which  is 
automatically  linked  to  the  user's  program  in  the  last  stage  of  the 
compilation.  This  saves  a  considerable  amount  of  compilation  time. 

The  accurate  fcomplex  divison  in  Pascal-SC  turns  out  to  be  slower  them  the 
function  CDIV.  According  to  the  table  given  in  [4],  p.  267,  a  typical  time  for 
the  complex  division  U/V  is  100  milliseconds  for  a  2.5MHz  Z80  processor.  The 
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function  CDIV(U,V)  user  six  real  multiplications/  two  real  divisions/  and  three 
real  addition/subtractions.  The  typical  times  for  these  operations  given  in  [4] 
total  60.4  milliseconds.  It  will  be  seen  later  that  some  Pascal-SC  operations 
are  actually  faster  than  their  inaccurate  real  simulations;  however/  in  the  case 
of  complex  multiplication  and  division,  one  pays  a  little  for  guaranteed, 
reliable  accuracy. 

In  addition  to  the  arithmetic  operators  +,  -,  *,  /  for  type  COMPLEX,  a 
number  of  additional  operators,  functions,-  and  procedures  are  provided  in  the 
Pascal-SC  complex  library  for  convenience.  For  details  on  these,  including  the 
domains  and  ranges  of  the  standard  functions,  see  [23]. 

Rounded  complex  operations  +<,  +>,  -<,  ->,  *<,  *>,  /<,  />  are  also  included 
in  an  external  library  in  the  form  of  pretranslated  code  [28].  Here,  rounding 
is  carried  out  componentwise.  Each  complex  number  z  =  (x,y)  with  |x|,|y|  < 
MAXREAL  will  belong  to  a  rectangle  with  corners  which  are  the  floating-point 
complex  numbers  A  *  (u,v),  B  =  (u  +  6,  v),  C  =  (u  +  6,  v  +  n),  D  =  (u,  v  +  n)» 
and  which  contains  no  other  floating-point  complex  numbers  (see  Figure  5.1). 

The  result  of  rounding  z  downward  will  be  Vz  =  A  =  (u,v),  while  z  is  rounded 
upward  to  Az  =  C  *  (u  +  6,  v  +  n),  where  6  and  n  are  the  spacings  in  the 
floating-point  screen  in  the  horizontal  and  vertical  directions  in  the  complex 
plane,  respectively.  The  BPA  rounding  of  z  will  be  (BPA(x),  BPA(y) ) ,  and  thus 
could  be  any  one  of  the  four  points  A,B,C,D.  For  example,  suppose 

(5.13)  z  =  (100  -  4i)/(565  +  789i), 

which  is  not  a  complex  floating-point  number.  Pascal-SC  operations  give 
A  =  (100,-4)/<(565,789)  =  (5. 66437234668E-02,-C. 61803501 157E-02 ) , 

(5.14) 

C  =  (100,-4)/>(565,789)  =  (5.66437234669E-02,-8.61803501156E-02), 
while  the  BPA  for  (5.13)  is 

(5.15)  D  =  (100,-4)/(565,789)  =  (5.66437234668E-02,-8.61803501156E-02). 


D 


A 


C 


B 


Figure  5.1.  A  Detail  of  the  COMPLEX  Floating-Point  Screen. 

In  this  case,  6  *  n  *  1.0  x  10”^,  and  the  results  (5.14)  and  (5.15)  locate  z  in 

the  rectangular  complex  interval  A  +  [0,  5.0x10”^)  +  i[5.0x10“^,  I.OxlO”1^], 
according  to  the  rules  for  Pascal-SC  rounding  (see  Figure  5.1). 
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Complex  arithmetic  with  directed  rounding  can  be  used  as  the  basis  for 
complex  interval  arithmetic  114). 


6.  FLOATING-POINT  INTERVAL  ARITHMETIC.  Interval  arithmetic  [1]  [17],  [16] 
is  based  on  the  use  of  closed/  finite  intervals  [a,b]  of  real  numbers  as  its 
basic  elements.  Interval  arithmetic  has  a  number  of  significant  applications  in 
scientific,  engineering,  and  statistical  confutation;  however,  its  use  has  not 
been  widespread  up  to  now  because  of  the  limitations  of  conventional  computer 
arithmetic  units  [18]  and  ordinary  programming  languages.  In  Pascal-SC, 
interval  arithmetic  has  been  implemented  efficiently,  and  is  just  as  convenient 
to  use  as  real  or  complex  arithmetic. 


In  the  general  theory  of  computer  arithmetic  [14],  interval  arithmetic  is 
regarded  as  a  special  case  of  arithmetic  on  subsets  of  real  numbers.  Here,  if 
X,  Y  are  subsets  of  R,  and  o  £  {+,  *,  /},  then 


(6. 1 } 


Z  =  X  o  y  =  {xoy  j  x  €  X,  y  €  Y}, 


by  definition.  For  division,  of  course,  0  €  V  is  excluded.  If  X  =  [a,b]  and 
Y  =  [c,d]  are  intervals,  then  Z  =  [r,s]  is  an  interval  if  defined,  and  the 
endpoints  r,s  of  Z  can  be  calculated  from  the  endpoints  of  X,Y  [14],  [17], 
[16].  It  is  assumed,  of  course,  that  the  intervals  X  =  [a,b]  considered  are 
finite  and  proper,  that  is,  a  £  b. 


The  most  basic  application  of  interval  arithmetic  is  the  following:  If  the 
value  of  a  function,  for  example. 


(6.2) 


w  *  9x4  -  y4  +  2y^, 


is  calculated  in  interval  arithmetic  for  intervals  X,Y,  then  the  resulting 
interval  W  will  contain  all  values  w  of  the  function  (6.2)  for  all  values  of  x  € 
X,  y  €  Y.  This  property  of  interval  arithmetic  allows  one  to  bound  the  ranges 
of  functions  without  detailed  analysis  of  maxima  and  minima.  Since  rounding  of 
interval  operations  is  outward  to  the  smallest  floating-point  interval  which 
contains  the  exact  results  [28] ,  automatic  bounds  for  round-off  error  can  be 
obtained  conveniently,  by  taking  the  input  intervals  X,Y  to  be  single  points, 
that  is,  X  =  [x,x],  Y  =  [y,y] ,  [1],  [17],  [16].  For  a  simple  application  of 
this  principle,  consider  the  evaluation  of  (6.2)  by  the  expressions 


(6.3) 


(a)  w  :=  9*x*x*x*x  -  y*y*y*y  +  2*y*y; 

(b)  w  :=  9*(x**4)  -  y**4  +  2*(y**2); 


w  :=  (3*<x**2l  -  y**2)*(3*(x**2)  +  y**2)  +  2*(y**2); 


in  REAL  arithmetic,  and 


W  :=  9*X*X*X«X  -  Y*Y*Y*Y  +  2*Y*Y; 


(6.4) 


W  :=  9*(X**4)  -  Y**4  +  2*(Y**2); 


W  :=  (3*(X**2)  -  Y**2)*(3*(X**2)  +  Y**2)  +  2*(Y**2); 


in  INTERVAL  arithmetic,  respectively.  The  power  operators  **  in  (6.3)  and  (6.4) 
are  given  in  Appendix  A  in  the  source  code  for  the  program  WEVAL  which  was  used 
to  calculate  the  following  results. 


For  x  =  10864,  y  =  18817,  the  statements  (6.3)  give 


For  x  =  10864,  y  =  18817,  the  statements  (6.3)  give 

(a)  w  =  1.58978  x  10s,  w  =  [-1.841022  x  106,  1.158978  x  106] , 

(6.5)  (b)  w  =  -8.41022  x  105,  W  =  [-8.410220  x  105,  1.158978  x  106], 

(c)  w  =  1.00000000000,  W  =  [  1.00000000000  ,  1.00000000000  ]. 

The  enormous  width  of  V?  in  (6. 5) (a)  and  (6.5) (b)  indicates  that  the  values  given 
for  w  can  be  subject  to  large  roundoff  error,  and  hence  are  untrustworthy.  On 
the  other  hand,  evaluation  of  (6.2)  by  the  statements  (6.3)(c)  and  (6.4)(c), 
respectively,  shows  that  this  formulation  is  highly  accurate,  and  in  fact  proves 
that  (6.3) (c)  yields  the  exact  value  w  =  1  of  the  function  (6.2)  for  the  given 
values  of  x  and  y.  This  example  also  shows  that  the  way  in  which  arithmetic 
expressions  are  written  can  be  crucial  for  accuracy.  The  techniques  for 
evaluation  of  arithmetic  expressions  with  maximum  accuracy  [6]  can  be  automated, 
and  are  incorporated  in  <.  Pascal-SC  demonstration  program  [26] . 

In  Pascal-SC,  floating-point  intervals  are  declared  in  the  stereotyped  way: 

TYPE  INTERVAL  =  RECORD  INF, SUP:  REAL  END; 

As  in  the  case  of  complex  numbers,  the  operators,  functions,  and  procedures  in 
an  external  library  manipulate  intervals  declared  in  this  form.  In  order  to 
preserve  the  inclusion  property  of  interval  arithmetic,  all  rounding  in 
floating-point  INTERVAL  arithmetic  is  outward:  The  result  given  for  XOY  is  the 
smallest  floating-point  interval  Z  which  contains  the  actual  result.  This  can 
be  implemented  by  the  use  of  directed  rounding,  for  example,  addition  can  be 
performed  by  the  operator 

(6.6)  OPERATOR  +  (A,B:  INTERVAL)  RES:  INTERVAL; 

VAR  C:  INTERVAL; 

BEGIN 

C. INF  :=  A. INF  +<  B.INF; 

C.SUP  :=  A. SUP  +>  B.SUP; 

RES  :=  C; 

END; 

This  operator  is  available  as  a  pretranslated  subroutine,  declared  by 

OPERATOR  +  (A,B:  INTERVAL)  RES:  INTERVAL 

EXTERNAL  68; 

If  I  denotes  a  generic  variable  of  type  INTERVAL,  and  K  of  type  INTEGER, 
the  following  15  arithmetic  operations  are  provided  for  interval  arithmetic: 

±1,  K  ±  I,  I  ±  K,  I  ±  I, 

K  *  I,  I  *  K,  I  *  K, 

K  /  I,  I  /  X,  I  /  I. 

Because  REAL  floating-point  expressions  do  not  necessarily  yield  the  exact 
values  of  their  >eal  results,  operators  between  types  REAL  and  INTERVAL  are  not 
included,  since  the  resulting  interval  might  not  contain  the  true  outcome  of  a 
computation.  In  addition  to  interval  versions  of  standard  functions  and  various 
utility  procedures  [28] ,  the  interval  library  includes  some  operators  which  work 
with  intervals  as  sets  of  real  numbers.  There  are  the  "lattice  operators" 
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(6.7) 


** 


Intersection, 
Interval  Hull, 


and  the  relational  operators 

<=  Subinterval, 


(6.8) 


>= 

Superinterval , 

X 

Disjointness, 

IN 

Point  Inclusion, 

[28] .  The  intersection  operator  **  will  generate  an  error  interrupt  if  its 
operands  are  disjoint  intervals,  otherwise,  their  intersection  I  fl  J  will  be 
computed.  If  I  =  [a,b]  and  J  =  [c,d],  then  I  ++  J  =  [min{a,c},max{b,d}]  is  the 
smallest  interval  which  contains  both  I  and  J.  The  relation  I  <=  J  is  TRUE  if 
I  C  J,  otherwise  FALSE;  similarly,  I  >=  J  is  TRUE  if  I  D  J.  The  result  of 
I  ><  J  is  TRUE  if  I  and  J  are  disjoint  intervals.  This  test  can  be  used  to 
avoid  I  **  J  in  this  case.  If  R  is  a  floating-point  number  (type  REAL),  R  IN  I 
is  TRUE  if  R  €  I  as  a  real  number. 


With  regard  to  the  efficiency  of  the  implementation  of  interval  arithmetic 
in  Pascal-SC,  Bohlender  and  Grflner  [4]  give  the  following  typical  times  in 
milliseconds  for  a  microcomputer  with  a  2.5MHz  Z80  processor: 


Operation 

+ 

— 

* 

/ 

REAL 

2.2 

2.2 

6.0 

10.0 

INTERVAL 

5.4 

5.4 

23.0 

31.0 

Considering  the  fact  that  each  interval  operation  has  to  determine  two  real 
numbers,  and  that  interval  multiplication  and  division  require  the  calculation 
of  four  real  products  in  one  case  [14],  the  above  indicates  an  almost  optimal 
implementation  in  software.  By  contrast,  factors  of  100  or  200  between  REAL  and 
INTERVAL  arithmetic  speed  have  been  noted  on  conventional  computers ,  such  as  the 
UNIVAC  1100  series.  [18].  In  addition,  while  a  REAL  confutation  provides  only  a 
floating-point  number  which  approximates  the  result,  an  INTERVAL  computation  on 
the  other  hand  provides  an  interval  which  is  guaranteed  to  contain  the  true 
result.  An  inportant  application  of  this  property  of  interval  calculations  will 
be  described  below  in  connection  with  the  solution  of  linear  systems  of 
equations  with  guaranteed  error  bounds. 

7.  REAL  FLOATING-POINT'  VECTOR  AND  MATRIX  ARITHMETIC.  Calculations  with 
real  vectors  and  matrices  are  among  the  most  commonly  encountered  tasks  in 
scientific,  engineering,  and  statistical  confutation.  Pascal-SC  offers  the  user 
the  same  reliability,  accuracy,  controllability,  and  convenience  when 
calculating  with  real  n-dimensional  floating-point  vectors  and  matrices  as  it 
does  for  the  scalar  types  REAL,  COMPLEX,  and  INTERVAL.  This  is  accomplished 
with  the  aid  of  an  external  source  code  library  of  operators,  functions,  and 
procedures  for  vector  and  matrix  manipulation,  and  a  built-in  function  SCALP  for 
the  calculation  of  scalar  products  of  floating-point  vectors  to  BPA  accuracy  or 
with  directed  rounding  at  the  option  of  the  user. 


7.1.  Convenience.  In  order  to  illustrate  the  convenience  of  Pascal-SC  for 
vector  and  matrix  calculations,  suppose  that  A,  B  are  nxn  matrices,  and  x,  y,  z 
are  n-dimensional  vectors.  To  evaluate 

(7.1)  z  =  5.5ABx  +  3y, 
the  corresponding  expression  is  Pascal-SC  is 

(7.2)  z  :=  5.5*A*B*x  +  3*y; 

which  uses  ordinary  operator  notation  instead  of  the  function  and  procedure 
calls  which  would  be  required  in  ordinary  Pascal  and  most  other  language 3.  In 
order  to  make  use  of  the  software  provided  in  the  corresponding  external 
library,  a  stereotyped  declaration  of  floating-point  vector  and  matrix  data 
types  is  expected: 

CONST  DIM  =  #;  {The  actual  dimension  replaces  #} 

TYPE  DIMTYPE  =  1. .DIM; 

RVECTOR  =  ARRAY  [DIMTYPE]  OF  REAL} 

RMATRIX  =  ARRAY  [DIMTYPE]  OF  RVECTOR; 

The  operators  +,  -,  *,  and  the  operators  +<,  +>,  -<,  ->,  *<,  *>  with 
directed  rounding  are  available  for  various  permissible  combinations  of 
operands,  for  example ,  multiplication  of  an  RVECTOR  by  an  INTEGER  or  REAL,  and 
so  on  [28] . 


7.2.  Reliability,  accuracy,  and  controllability:  The  scalar  product 
SCALP.  The  general  theory  of  conqputer  arithmetic  [14]  requires  that  each 
component  of  the  result  of  a  vector  or  matrix  operation  be  rounded  to  the  BPA 
for  the  actual  real  result,  or  downward  or  upward  to  the  closest  neighboring 
floating-point  number  if  desired.  Addition  and  subtraction  of  vectors  and 
matrices  present  no  problems  from  the  standpoint  of  this  requirement,  since  the 
desired  results  can  be  calculated  componentwise  with  the  aid  of  the  six  REAL 
arithmetic  operators  ±,  ±<,  ±>  described  in  §3.  Calculation  of  the  scalar 
products  of  vectors,  which  is  an  inherent  component  of  matrix  and  matrix  by 
vector  multiplication,  is  a  different  matter.  Ordinarily,  this  calculation  is 
simulated  by  a  FOR  loop  of  real  operations,  such  as  in  the  following  function: 

(7.3)  FUNCTION  SPROD(A,B:  RVECTOR):  REAL; 

VAR  I:  DIMTYPE;  S:  REAL; 

BEGIN 

S  :=  0; 

FOR  I:=1  TO  DIM  DO 
S  :=  S  +  A  [I]  *  B[I] ; 

SPROD  :=  S 
-  END; 

This  is  an  example  of  what  Kulisch  calls  the  "vertical"  definition  of  computer 
arithmetic  [141,  [12].  Of  course,  there  is  no  hope  that  the  result  of  SPROD 
will  be  accurate  in  general.  For  this  reason,  the  internal  calculations  in  a 
function  of  thi3  kind  are  often  done  in  higher  precision  than  the  external 
calculation.  While  this  is  a  often  a  great  help  in  some  cases,  it  still  does 
not  solve  the  accuracy  problem.  On  the  other  hand,  the  Pascal-SC  function 


(7.4) 


SCALP(A,B:  RVECTOR;  ROUND:  INTEGER); 


[v 


which  is  a  built-in  feature  of  the  compiler/  will  calculate  the  value  of  the 
exact  scalar  product  of  A  and  B  to  an  adjacent  floating-point  number,  with 
rounding  downward,  upward,  or  to  the  BPA  controlled  by  the  value  of  the 
parameter  ROUND  [28].  This  reliability,  accuracy,  .and  controllability  is 
required  by  the  general  theory  of  computer  vector  and  matrix  arithmetic  [14]. 

It  can  be  achieved  by  special  algorithms  [14],  or  the  provision  of  a 
sufficiently  long  accumulator.  In  the  microcomputer  version  of  Pascal-SC,  this 
"long  accumulator"  is  implemented  in  software  [4],  but  the  same  thing  can  be 
done  in  hardware  [13],  and  can  be  expected  to  be  a  feature  of  future  advanced 
mainframe  computers. 

In  order  to  allow  the  accumulation  of  several  scalar  products,  uhe 
parameter  ROUND  can  also  inhibit  the  clearing  of  the  long  accumulator  before  the 
product  is  calculated  [28].  The  corresponding  values  are  given  in  the  following 
table: 


Rounding  Clear  Long  Accumulator  Inhibit  Clearing 

BPA  ROUND  =  0  ROUND  =  4 

Downward  ROUND  =  -1  ROUND  =  3 

Upward  ROUND  =  1  ROUND  =  5 

If  the  long  accumulator  is  not  being  cleared,  other  arithmetic  operatic  -Z  are 
not  permitted  between  successive  calls  of  SCALP  [28] . 

The  use  of  SCALP  makes  it  possible  to  calculate  the  results  jiatrix  and 
matrix  by  vector  multiplications  to  the  closest  floating-point  numbers,  or 
rounded  to  the  closest  larger  or  smaller  neighboring  values  if  desired.  This 
reliability,  accuracy,  and  controllability  distinguishes  Pascal-SC  vector  and 
matrix  arithmetic  from  traditional  packages. 

Some  of  the  important  properties  of  SCALP  are  illustrated  by  the  following 
examples. 


Accuracy 


(7.5) 


=  do". 


10”",  -10"), 


<1,  1,  D, 


the  value  of  SPROD(A,B)  is  0,  of  course,  while  SCALP(A,B,0)  gives  the  correct 
answer  10“".  Persons  who  believe  that  multiple  precision  will  solve  all 
accuracy  problems  should  determine  how  much  precision  is  required  on  their 
machine  for  SPR0D(A,3)  to  duplicate  this  result.  Once  satisfied,  they  can  then 
try  (7.6)  below.  Although  these  examples  are  extreme,  examples  can  be  given  for 
which  SPROD  is  highly  inaccurate  for  vectors  of  the  types  one  can  expect  to 
encounter  in  actual  problems. 

b.  Speed.  It  turns  out  that  the  accurate  scalar  product  function  SCALP  is 
faster  than  the  corresponding  FOR  loop  in  SPROD  if  DIN  >  1.  The  typical  times 
given  in  [4]  in  milliseconds  for  a  2.5MHz  Z80  processor  cure: 

SCALP  8  +  5. 5 (DIM), 

FOR  LOOP  1  +  9. 6 (DIM). 

Since  all  the  matrix  and  matrix  by  vector  multiplication  routines  are  based  on 
SCALP,  they  can  be  expected  to  execute  faster  than  their  inaccurate  simulations 
in  REAL  arithmetic.  Furthermore,  the  current  implementation  of  Pascal-SC 


handles  access  to  elements  of  arrays  very  efficiently  by  means  of  an  "array 
descriptor"  [11],  from  which  addresses  of  elements  can  be  calculated  quickly. 

c.  Deferred  overflow.  Under  ordinary  circumstances,  SCALP  will  not 
indicate  an  overflow  unless  the  actual  real  result  is  outside  the  range  of 
representable  floating-point  numbers.  For  example,  for 

(7.6)  A  =  (1099,  10"",  -10"),  B  =  (10",  1,  10"), 

£?CALP(A,B,  0 )  will  compute  the  correct  result  10“",  while  SPROD(A,B)  will 
overflow  for  1=1. 


d.  Ease  of  use.  The  function  SCALP  is  just  as  easy  to  use  as  SPROD,  and 
requires  no  additional  source  code  in  the  program,  since  it  is  part  of  the 
Pascal-SC  system.  Furthermore,  SCALP  is  more  versatile,  since  its  arguments  can 
be  arbitrary  one-dic .nsional  arrays  of  floating-point  numbers  of  the  same 
length,  of  which  RVECTOR  is  only  a  special  case.  When  called,  SCALP  consults 
the  array  descriptors  of  its  arguments  [11]  to  determine  if  they  are  in  fact  of 
equal  length,  and  then  calculates  their  scalar  product  if  this  is  true.  Thus, 
SCALP  can  be  used  to  calculate  scalar  products  of  vectors  of  various  dimensions 
in  the  same  program. 

8.  COMPLEX  AND  INTERVAL  FLOATING-POINT  VECTOR  AND  MATRIX  ARITHMETIC.  The 
basic  ideas  here  are  generally  the  same  as  for  real  vector  and  matrix 
arithmetic:  Convenience,  based  on  the  use  of  operator  notation  for  expressions, 
and  accuracy.  The  subroutines  in  MCLXB  and  MILIB,  respectively,  «'">ect  the  type 
declarations 


TYPE  CVECTOR  =  ARRAY  [DIMTYPE] 
CMATRIX  =  ARRAY  [DIMT1PE] 


OF  COMPLEX; 
OF  CVECTOR; 


and 


TYPE  IVECTOR  =  ARRAY  [DIMTYPE] 
IMATRIX  =  ARRAY  [DIMTYPE] 


OF  INTERVAL; 
OF  IVECTOR; 


in  the  corresponding  cases. 

For  accurate  scalar  products,  the  respective  functions 


FUNCTION  C SCALP  (VAR  A,3:  CVECTOR;  AKDIM:  INTEGER):  COMPLEX; 
EXTERNAL  188; 


and 


FUNCTION  ISCALP  (VAR  A,B:  IVECTOR;  AKDIM:  INTEGER):  INTERVAL; 

EXTERNAL  88; 

are  provided.  The  functions  form  the  basis  of  the  subroutines  for  accurate 
matrix  and  matrix  by  vector  multiplication.  The  products  are  computed  from  the 
first  AKDIM  components  of  each  vector  argument.  This  permits  the  flexibility  of 
using  vectors  of  various  dimensions  AKDIM  _<  DIM  in  the  same  program.  C SCALP 
computes  the  BPA  for  the  scalar  product  of  complex  vectors;  directed  rounding  is 
not  provided  for  the  complex  scalar  product  or  complex  matrix  and  matrix  by 
vector  multiplications.  ISCALP  computes  the  smallest  interval  which  contains 
the  exact  result,  as  in  the  case  of  other  interval  operations  [28] . 
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9.  SOLUTION  OF  LINEAR  SYSTEMS  OF  EQUATIONS  AND  MATRIX  INVERSION.  These 
are  problems  which  arise  time  after  time  in  scientific,  engineering,  and 
statistical  computation.  The  Pascal-SC  system  subroutines  for  these  purposes, 
which  use  the  accurate  scalar  product  and  interval  arithmetic,  yield  results 
which  are  far  more  exact  than  can  be  obtained  by  ordinary  floating-point 
arithmetic.  Furthermore,  guaranteed  error  bounds  are  given  for  results,  so  the 
reliability  of  the  computation  is  immediately  determinable.  These  subroutines 
will  accept  either  real  or  interval  vectors  and  matrices. 

The  basic  procedure  in  the  system  library  LGLLIB  for  the  solution  of  linear 
systems  of  equations  is  LGLP  (an  acronym  for  the  German  words  for  "linear 
equations  solution  program").  This  procedure  is  declared  by 

PROCEDURE  LGLP (DIM, AKDIM:  INTEGER;  VAR  A:  RMATRIX,  VAR  B:  RVECTOR; 

VAR  Y:  IVECTOR ) ? 

EXTERNAL  524; 

[28] .  The  purpose  of  this  procedure  is  to  solve  the  linear  system 
(9.1)  Ax  =  B 

with  coefficient  matrix  A  and  right-hand  side  B.  Instead  of  a  floating-point 
approximation  to  the  solution  x,  LGLP  calculates  an  interval  vector  Y  which,  if 
proper,  contains  the  exact  solution  x  of  (9.1),  and  proves  that  the  floating¬ 
point  matrix  A  is  a  nonsingular  real  matrix  [27].  This  allows  one  to  determine 
not  only  an  approximate  value  for  x,  but  also  guaranteed  error  bounas  for  it 
[23].  The  parameter  AKDIM  in  the  formal  parameter  list  allows  one  to  solve 
systems  of  size  smaller  than  DIM  if  desired;  only  the  first  AKDIM  rows  and 
columns  of  A  and  components  of  B  are  involved  in  the  calculation. 

Failure  of  LGLP  to  return  a  proper  interval  vector  Y  indicates  that  A  is 
singular  or  extremely  ill-conditioned.  In  this  case,  the  components  of  Y  will 
be  set  equal  to  the  improper  interval  [+1,-1].  A  test  should  be  made  for  this 
condition  immediately  on  return  from  LGLP,  since  all  interval  subroutines  expect 
proper  intervals  as  data  [28] . 

Thus,  LGLP  either  gives  a  solution  with  guaranteed  accuracy  or  an  error 
indication.  In  practice,  LGLP  has  been  observed  to  succeed  for  well-known 
examples  of  badly  conditioned  matrices,  such  as  Hilbert  matrices  [26],  [27].  In 
Appendix  B,  a  simple  program  to  solve  linear  equations  of  order  up  to  20  is 
given,  together  with  its  application  to  a  system  of  five  equations  in  five 
unknowns  in  which  the  coefficient  matrix  has  a  condition  number  larger  than 
4  x  10 18.  Inspection  of  the  resulting  interval  vector  Y  shows  that  LGLP  was 
able  to  solve  this  system  to  an  accuracy  of  one  unit  in  the  twelfth  significant 
digit.  The  same  5x5  system  defeated  a  standard  linear  equation  solver  on  a 
VAX  11/780. 

When  one  is  solving  severa’  systems  with  tne  same  matrix  but  different 
right  sides,  considerable  time  c^n  be  saved  if  the  approximate  LU-decomposition 
of  A  and  other  preliminary  calculations  are  only  done  once.  The  procedure  LGLPR 
is  available  for  this  purpose.  Its  declaration  is 

PROCEDURE  LGLPR  (DIM, AKDIM;  INTEGER;  VAR  A;  RMATRIX;  VAR  B;  RVECTOR; 

NRS:  BOOLEAN;  VAR  R:  RMATRIX;  VAR  MB;  IMATRIX; 

VAR  Yj  RVECTOR); 

EXTERNAL  522; 
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[28] .  The  first  call  of  this  procedure  is  with  NRS  =  FALSE.  Matrices  needed  to 
process  subsequent  right  sides  will  be  computed  and  stored  as  R  and  MB. 
Subsequently,  LGLPR  is  called  with  NRS  =  TRUE  for  each  new  right  side. 


Similar  procedures  LGLI  and  LGLIR  are  available  for  the  case  that  the 
components  of  the  coefficient  matrix  A  and  right  side  B  are  intervals,  that  is, 

A  is  of  type  IMATRIX  and  B  is  of  type  IVECTOR.  The  interval  vector  Y  in  this 
case  bounds  all  solutions  of  real  systems  with  coefficient  matrices  belonging  to 
A  and  right  sides  belonging  to  B.  This  can  be  helpful  in  case  where  the  data 
are  subject  to  uncertainty. 

For  matrix  inversion,  the  subroutine 

PROCEDURE  INVP  (DIM, AKDiM:  INTEGER;  VAR  A:  RMATRIX;  VAR  C:  IMATRIX); 

EXTERNAL  526; 

will,  if  successful,  compute  an  interval  matrix  C  which  contains  the  inverse  of 
the  real  (point)  matrix  A.  Singularity  or  extreme  ill-condition  of  A  ie 
reported  in  the  same  way  as  for  LGLP,  while  successful  calculation  of  C  proves 
that  A  is  nonsingular,  as  before.  Finally, 

PRODCEDURE  INVI  (DIM, AKDIM;  INTEGER;  VAR  A,C:  IMATRIX); 

EXTERNAL  527; 

is  used  for  inversion  of  interval-valued  matrices.  If  C  is  computed 
successfully  as  an  IMATRIX  of  proper  intervals,  then  C  contains  the  inverses  of 
all  real  macrices  contained  in  the  interval  matrix  A.  Return  of  C  with  all 
components  equal  to  the  improper  interval  [+1,-1]  indicates  that  A  contains  at 
least  one  singular  or  very  badly  conditioned  real  matrix. 

On  a  microcomputer  with  64Kb  of  storage,  LGLP  and  LGLPR  are  limited  to 
about  DIM  =  25  or  less,  LGLI,  LGLIR,  and  INVP  to  DIM  =  20,  and  INVI  to  DIM  = 

15.  For  these  relatively  small  systems,  the  time  required  for  execution  seems 
to  be  reasonable.  As  in  the  case  of  the  rest  of  the  Pascal-SC  system,  source 
code  for  declarations  and  pretranslated  code  for  the  above  procedures  can  be 
found  in  an  external  library. 

10.  EIGENVALUES  AND  EIGENVECTORS.  The  Pascal-SC  system  provides  the 
standard  subroutine 

PROCEDURE  EIGEN  (DIM, AKDIM;  INTEGER;  VAR  A;  RMATRIX;  LAMBDA:  REAL; 

VAR  X:  RVECTOR;  VAR  I  LAMBDA:  INTERVAL;  VAR  Y:  IVECTOR); 

EXTERNAL  534; 

for  the  calculation  of  guaranteed  interval  bounds  for  real  eigenvalues  and 
vectors  of  real  matrices  A,  in  particular,  symmetric  matrices.  This  is  another 
type  of  calculation  which  occurs  often  in  engineering  and  other  scientific 
computation.  In  addition  to  the  actual  dimension  AKDIM  and  the  matrix  A,  EIGEN 
expects  floating-point  approximations  LAMBDA  and  X  to  the  eigenvalue  and 
eigenvector  of  interest,  or  at  least  values  with  which  to  start  the 
calculation.  If  successful,  the  interval  value  ILAMBDA  and  interval  vector  Y 
returned  include  an  exact  real  eigenvalue  and  eigenvector  of  the  floating-point 
matrix  A,  and  furthermore  guarantee  that  the  included  eigenvalue  is  of 
multiplicity  one.  Hence,  EIGEN  will  not  succeed  for  multiple  eigenvalues  [si], 
[28].  In  case  of  failure,  EIGEN  will  return  improper  intervals  for  ILAMBDA  and 
the  components  of  Y. 
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1 1 >  THE  ACCURATE  SUM  OF  t  FLOATING-POINT  NUMBERS.  Statistical 
calculations,  in  particular,  often  require  the  computation  of  the  sum  of  n 
float!  ng-poixit  numbers  and  perhaps  also  their  squares, 

n  n  0 

(11. V  S  =  E  a  ,  T  =  E  a“  . 

i=1  1  i=1 

In  Pascal-SC,  it  is  possible  to  compute  the  EPA  for  S  and  T.-  or  round  the  result 
upward  or  downward  to  the  closest  floating  point  number  by  taking  the  a^  as 
components  of  an  RVECTOR  A  and  using  SCALP.  For  E  -  (1,1,1,...,1),  one  has 

(11.2)  S  =  SCALP (A, B, ROUND);  and  T  =  SCALP (A, A, ROUND ) ; 

with  t».‘-  desired  best-possible  result.  However,  in  the  case  of  the  sum  S,  the 
standard  ?a>»^al-SC  subroutine 

FUNCTION  SUM  (VAR  A:  RVECTOR j  AKLIM,  ROUND:  INTEGER):  REAL; 

EXTERNAL  480; 

is  also  provided.  This  function  performs  the  addition  of  the  first  AKDIM 
elements  of  A  to  the  BPA  or  result  of  directed  rounding  of  the  BPA.  SUM,  like 
SCALP,  uses  the  long  accumulator,  and  the  values  of  ROUND  have  tl'ia  same 
significance  as  given  in  §7  for  SCALP.  It  is  chus  possible  to  call  SUM  again 
without  clearing  the  long  accumulator,  to  allow  independent  accumulation  of 
partial  sums  without  loss  of  accuracy.  However,  no  other  arithmetic  operations 
are  allowed  between  successive  calls  to  SUH  [28] . 

When  computing  sums  of  interval  numbers,  one  can  use 

(11.3)  IS  :=  ISCALP(IA,IE,DIK); 

wl.ire  IE  has  components  all  equal  to  [1,1].  For  the  interval  sum  of  squares, 
however,  it  is  preferable  to  form  the  interval  vector  JQA  with  components  equal 
to  ISQR(A (I] )  for  I  =  1..DIM,  and  then  compute 

(11.4)  IT  :=  ISCALP(IQA,IE,DIM); 

rather  than  ISCALP(IA. Ia, DIM),  for  the  reason  given  in  §6  about  the 
preferability  of  ISOR(X)  to  X*X  for  intervals. 

12.  PROGRAMMING  IN  PASCAL-SC.  The  only  new  techniques  in  Pascal-SC  for  a 
Pascal  programmer  to  acquire  are  the  definition  and  use  of  operators.  Except 
for  these,  there  is  no  difference  between  Pascal  and  Pascal-SC  programming. 
Therefore,  the  discussion  here  will  focus  on  the  operator  concept  [19].  The 
definition  of  an  operator  subroutine  is  headed  by 

OPERATOR  <name>  (<r'ormal  parameter  list>)  <resuit  narae>:  <resulc  type>; 

The  code  following  this  heading  is  the  same  as  for  a  function  having  the  same 
purpose.  The  result  must  be  assigned  to  <result  i.ame>  in  its  entirety  before 
leavinq  the  subroutine.  For  example,  if  <result  name>  =  RES  is  of  type 
INTEV  ..-J,,  one  must  calculate  an  interval  U  and  make  the  assignment  RES  :*  U; 
before  leaving  the  subroutine,  rather  than  calculating  RES. INF  and  FF.S.SUP 
separately.  The  formal  parameter  list  consists  of  one  or  two  identifiers  and 
their  types.  Thus,  operators  are  either  unary  or  binary.  Since  operators  occur 


«ia*rjiOTax,’nin^.viArrx.‘jfvcivjr«Trw  cmr>bp>.ua  y-wp^-i 


m-t.  W-i.  jvk  - .r^m r. 


in  expression  strings  ("infix"  notation),  their  arguments  have  to  be  of 
expression  type.  That  is,  VAR  A,  etc.,  is  not  allowed  in  the  formal  parameter 
list.  The  examples  of  operators  given  in  §5  and  §6  and  below  can  be  used  as 
models . 

There  are  two  ways  to  name  an  operator  in  Pascal-SC: 

(i)  By  redefining  ("overloading")  one  of  the  standard  Pascal-SC  operator 
symbols  for  a  new  data  type  or  types. 

(ii)  By  use  of  an  arbitrary  name  selected  by  the  user  which  conforms  to 
the  ordinary  rules  for  identifiers  in  Pascal  [10].  In  this  case  (see  below), 
the  priority  of  the  operator  also  has  to  be  declared. 

These  two  methods  will  be  discussed  separately. 

12.1.  Overloading  standard  operator  symbols.  This  is  the  most  common 
method  used  in  scientific  and  engineering  computing  to  name  Pascal-SC  operators, 
since  one  usually  wishes  to  follow  the  ordinary  mathematical  notation 
encountered  in  the  formulas  being  used.  The  standard  operator  symbols  in 
Pascal-SC  are,  in  order  of  decreasing  priority: 

Unary  operators: 

NOT,  +  (unary),-  -  (unary) 

Multiplicative  (binary)  operators: 

*  /  DIV  MOD  ANO  **  *>  *<  />  /< 

Additive  (binary)  operators: 

+  -+>+<  ->  -<  ++  OR 

Relational  (binary)  operators: 

=  <><->=<>  IN  >< 

The  fundamental  distinction  between  a  unary" and  a  binary  OPERATOR  is  that 
the  formal  parameter  list  Cor  the  operator  contains  exactly  one  parameter  in  the 
first  case,  and  exactly  two  in  the  second,  and  these  are  the  only 
possibilities.  An  overloaded  operator  wili  have  the  same  priority  as  its  symbol 
in  the  table  above.  In  the  case  of  +  and  the  parameter  list  of  the  operator- 
heading  will  specify  whether  they  are  unary  (highest  priority)  or  binary. 

One  convenience  of  Pascal-SC  that  is  immediately  apparent  is  that  one  can 
define  **  to  perform  exponentiation  on  whatever  numerical  types  are  appropriate 
for  the  application  at  hand.  However,  this  should  be  done  with  care.  Some  good 
methods  are  given  in  [7],  Source  code  for  a  simple,  "repeated  squaring"  [22] 
implementation  of 

OPERATOR  **  (R:  REAL?  K:  INTEGER)  RES:  REAL; 

V* 

to  perform  R  for  integral  powers  of  floating-point  numbers  is  given  in  Appendix 

A.  This  operator  makes  it  possible  to  write  x^,  x4,  etc.  as  x**3,  x**4,  etc.  in 
expressions  to  be  evaluated,  which  is  a  more  convenient  way  to  represent  these 
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simple  powers  tha<:  by  a  procedure  or  function  call  as  in  ordinary  Pascal.  (A 
function  or  procedure  should  be  used  to  compute  the  result  of  raising  an 
interval  base  to  an  interval  power,  since  the  operator  **  is  vised  to  compute  the 
intersection  of  INTERVAL  variables  (see  (6.7)).) 

The  order  of  the  operands  in  the  formal  parameter  list  determines  the  order 
in  which  the  operator  will  ba  applied.  The  compiler  distinguishes  various  uses 
of  the  same  operator  symbol  by  the  type(s)of  its  operand(s),  and  their  order  if 
the  operator  is  binary.  Thus,  in  the  same  program,  "+"  can  be  used  to  denote 
addition  of  complex  numbers,  intervals,  vectors,  matrices,  quaternions, 
polynomials,  etc.,  in  addition  to  its  standard  meaning  for  integers  and 
floating-point  numbers.  All  that  is  required  is  that  the  appropriate  definition 
of  OPERATOR  +  be  given  in  the  heading  of  the  program  for  each  meaning  of  "+"  in 
the  body  of  the  program. 

Of  course,  the  compiler  recognizes  only  the  rules  of  arithmetic  for  user- 
defined  data  types  which  are  provided  to  it  by  the  programmer.  For  example,  if 
one  wishes  to  use  expressions  in  which  variables  of  both  type  INTEGER  and  type 
GRADIENT  f22]  appear,  both 

OPERATOR  +  (K:  INTEGER;  G;  GRADIENT)  RES:  GRADIENT; 

and 

OPERATOR  +  (G:  GRADIENT;  K:  INTEGER)  RES:  GRADIENT; 

must  be  defined  in  the  heading  of  the  program  so  that  the  compiler  czr.  produce 
code  for  both  K  +  G  and  G  +  K.  Type  GRADIENT  consists  of  the  value  of  a 
function  together  with  its  gradient  vector,  and  is  declared  by 

TPE  GRADIENT  =  RECORD  F:  REAL;  DF:  RVECTOR  END; 


in  Poi. .  V.1-SC  [22].  In  this  case,  both  operators  produce  the  same  result, 
consisting  of  the  alteration  of  the  function  value  G.F  of  the  GRADIENT  variable 
G  to  K  +  G.F  =  G.F  +  K,  respectively,  with  no  change  in  the  gradient  vector  G.DF 
[22].  However,  it  could  happen  that  the  user  is  working  with  quantities  for 
which  addition  is  not  necessarily  commutative.  Pascal-SC  allows  the  possibility 
of  defining  the  result  of  "+"  or  any  other  binary  operator  to  be  dependent  on 
the  order  of  the  operands. 


12.2.  Named  operators.  In  Pascal-SC,  the  user  can  name  operators 
according  to  the  ordinary  Pascal  rules  for  identifiers  [10].  For  example,  the 
factorial  operator  (a  unary  operator)  could  be  called  FAC.  In  this  case,  FAC  4 
would  have  the  value  4!  =  24.  Note  that  parentheses  are  not  used  unless  the 
operation  is  applied  to  an  expression.  Similarly,  a  named  binary  operator  is 


written  between  its  operands  ("infix"  notation)  in  the  same  way  §is  +,  -,  *,  /, 
etc.  The  operator  FAC  can  be  defined  in  the  program  heading  as  follows: 

OPERATOR  FAC  (A:  INTEGLR)  RES:  INTEGER; 

BEGIN 

IF  A  <=  1  THEN  RES  :-=  1 

ELSE  RES  :=  A  *  FAC  (A  -  1); 

END;  {  Recursive  definition  of  OPERATOR  FAC  } 

In  terms  of  FAC,  th6  binomial  coefficient  C(N,K)  could  be  computed  by  use  of  the 
statement 


(12.1) 


BINOM  :=  FAC  N  DIV  (FAC  K  *  FAC  (N  -  K)); 


This  example  is  for  only  for  illustration  of  the  construction  of  a  named 
operator  and  the  possibility  of  recursion.  On  the  microcomputer  implementation 
of  Pascal-SC,  INTEGER  arithmetic  is  implemented  only  for  integers  I  such  that 
-32768  <_  1  <_  32767  [28],  and  thus  FAC  N  can  be  computed  only  for  N  <_  7.  Actual 
computation  of  factorials  should  be  accomplished  by  a  type  conversion  to  REAL, 
and  controlled  by  WHILE  or  UNTIL,  in  order  to  avoid  stacking  recursions  too 
deeply . 

Similarly,  the  binary  Boolean  operator  XOR  for  "exclusive  or"  could  be. 
defined  by 

OPERATOR  XOR  (A,B:  BOOLEAN)  RES:  BOOLEAN; 

BEGIN 

RES  :=  (A  AND  NOT  B)  OR  (NOT  l  AND  B); 

END; 

A  typical  program  statement  using  XOR  would  be 

(12.2)  IF  OBS1  XOR  0BS2  THEN  PRC®  :=  0.25  ELSE  PROB  :=  0.75; 

which  would  assign  the  value  0.25  to  PROB  if  just  one  of  OBS1rOBS2  is  TRUE,  or 
0.75  otherwise. 


In  order  for  the  Pascal-SC  compiler  to  recognize  FAC  and  XOR  as  the  names 
of  operators,  and  assign  priorities  to  them,  a  PRIORITY  declaration  for  each 
named  operator  must  follow  directly  after  the  heading  line  of  the  program, 
which  gives  the  name  of  the  program  and  the  list  cf  files  used.  From  highest  to 
lowest  priority,  these  priority  declarations  have  the  forms 


PRIORITY  <Operator  name>  = 

PRIORITY  <0perator  name>  =  *; 

PRIORITY  <Operator  name>  r*  +; 

PRIORITY  <0perator  name>  =  =; 


{  Unary  operators  } 

{  Multiplicative  operators  } 
{  Additive  operators  } 

{  Relational  operators  } 


Thus,  suppose  one  writes  a  program  called  CHANCE  which  uses  the  operators 
FAC  and  XOR  to  calculate  probabilities  of  outcomes  in  some  stochastic  model,  and 
only  the  standard  files  INPUT  and  OUTPUT  are  used  for  communication  between  the 
program  and  the  outside  world.  If  XOR  is  to  have  the  same  priority  as  OR,  then 
the  first  three  lines  in  the  heading  of  the  source  code  for  the  program  would  be 


PROGRAM  CHANCE  ( INPUT , OUTPUT); 


PRIORITY  FAC  =  @; 

XOR  =  +; 

The  standard  sequence  of  definitions  and  declarations  would  then  follow  to 
complete  the  heading  of  the  program,  and  then  the  body  of  the  program  consisting 
of  the  actual  statements  to  be  executed.  The  structure  of  a  Pascal-SC  program 
therefore  differs  only  slightly  from  that  of  an  ordinary  Pascal  program,  as 
shown  in  the  next  section. 
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nonlinear  integral  equation  [25]/  the  solution  of  nonlinear  systems  of  equations 
by  iterative  methods  [22],  [9],  and  the  solution  of  ordinary  differential 
equations  by  real  and  interval  Taylor  series  [8].  In  these  applications  the 
operator  concept  of  Pascal-SC  was  used  to  implement  automatic  evaluation  of 
derivatives  and  Taylor  series  for  functions  defined  by  expressions  in  ordinary 
mathematical  notation  [8],  [22],  [24].  The  microcomputer  systems  used  in  these 
investigations  can  best  be  described  as  minimal:  Eight-bit  machines  with  Z80 
processors,  64Kb  of  main  storage,  and  two  disk  drives.  The  Pascal-SC  compiler 
used  was  developed  by  Profs.  >’J.  Kulisch  and  and  H.-W.  Wippermann  and  their 
associates  at  the  Universities  of  Karlsruhe  and  Kaiserslautern  in  Germany,  and 
is  described  in  [3],  [19],  and  [28].  Even  these  modest  resources  appear 
adequate  for  many  of  the  day-to-day  calculations  needed  by  engineers, 
scientists,  and  statisticians,  as  well  as  for  research  on  methods  in  numerical 
analysis  which  can  be  applied  to  larger  problems.  As  “personal  computers"  grow 
in  size  and  speed,  the  accuracy  and  convenience  of  Pascal-SC  will  provide  the 
user  with  a  more  powerful  tool,  and  its  features  will  also  be  advantageous  on 
forthcoming  larger  machines. 
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APPENDIX  A 


Evaluation  of  (6.2)  w  =  9x  - 


4  2 ' 

y  +  2y  in  Pascal-SC 


Source  code  for  the  program  WEVAL. 


PROGRAM  WEVAL ( INPOT , OUTPUT ) ; 

<*  This  program  calculates  w  ■  9*(x**4)  -  y**4  +  2*(y**2)  in  real  and 
interval  arithmetic.  *) 

IOSES  INTERVAL;  (*  DIRECTS  COMPILER  TO  USE  INTERVAL  LIBRARY  *) 

VAR  Cs  CHAR;  w,x,y:  REAL;  W,X,Y:  INTERVAL; 

(*  POWER  OPERATORS  *) 

OPERATOR  **  (R:  REAL;K:  INTEGER)  RES:  REAL;  (*  R  **  K  *) 


VAR  L:  INTEGER; U:  REAL; 

BEGIN  (*  OPERATOR  R  **  K  *) 

IF  (R  -  0)  AND  (K  <-  0)  THEN 
BEGIN  (*  ERROR  *) 

WRITELNC EXPONENTIATION  ERROR,  0  **  K,  K  <-  O'); 
SVR(O)  (*  RETURN  TO  OPERATING  SYSTEM  *) 

END;  (*  ERROR  *) 

IF  (K  -  0)  OR  (R  -  1)  THEN  U:»l 

ELSE  IF  K  -  1  THEN  U:«R 

ELSE  (*  K  <>  0,1  *) 

BEGIN  (*  REPEATED  SQUARING  *) 

L:-ABS(K);U:-1; 

REPEAT 

IF  L  MOD  2  -  1  THEN  U:-R*D; 

1**1.  DIV  2; 

IF  L  <>  0  THEN  R:»K*R 
UNTIL  L  -  0; 

IF  K  <  0  THEN  U:«l/U  (*  NEGATIVE  EXPONENT  *) 

(*  REPEATED  SQUARING  *) 

RES:«U 

END;  (*  OPERATOR  R  **  K  *) 
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OPERATOR  **  (It  INTERVAL}*:  INTEGER)  RESs  INTERVAL}  (*  I  **  K  *) 

VAR  L:  INTEGER}!?:  INTERVAL} 

BEGIN  (*  OPERATOR  I  *★  K  *) 

IP  (0  IN  I)  AND  \K  <»  0)  THEN 
BEGIN  (*  ERROR  *) 

WRITELNC  EXPONENTIATION  ERROR,  I  **  K,  0  IN  I*)} 
SVR(O)  (*  RETURN  TO  OPERATING  SYSTEM  *) 

END}  (*  ERROR  *) 

IF  K  »  0  THEN 

BEGIN  (★  K  «  0  *) 

U.INF:-1}0.SUP:-1 
END  (★  K  -  0  *) 

ELSE  IP  K  -  1  THEN  U:-I 

ELSE  (*  K  <>  0,1  *) 

BEGIN  (*  REPEATED  SQUARING  *) 


L:-ABS(K)}U.INFs-l}U.SUP:«l} 

REPEAT 

IP  L  MOD  2  -  1  THEN  U:-I*U} 

L:-L  DIV  2} 

IP  L  <>  0  THEN  I:*ISQR(I); 

UNTIL  L  «  0} 

IF  K  <  0  THEN  U:«l/U  (*  NEGATIVE  EXPONENT  *) 

END}  (*  REPEATED  SQUARING  *) 

RES:-U 

END}  (*  OPERATOR  R  **  K  *) 

(*  END  OF  POWER  OPERATORS  *) 

PROCEDURE  VVRITE(v:  REAL}  V:  INTERVAL)} 

BEGIN  (*  WRITE  RESULTS  OP  REAL  AND  INTERVAL  EXPRESSIONS  *) 

WRITELN} WRITELNC  v  -  ',«)}  (*  OUTPUT  OP  RESULTS  *) 

WRITELN} WRITELNC  W-  l '  ,W.INF , ' , '  ,W.SUP, '  1 ' )  } 

END}  (*  WRITE  RESULTS  *) 
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BEGIN  (*  MAIN  PROGRAM  *) 

C  :-  ’Y’;  WHILE  C  -  'Y'  DO 
BEGIN  (*  ACTUAL  CALCULATION  *) 

WRITELN(' ENTER  VALUES  OF  X  AND  Y* ) ;READ(x,y) ; 

X  INTPT(x);  Y  :«  INTPT(y) ;  (*  CONVERT  REAL. VALUES  TO  INTERVALS  *) 

v  :«  9*x*x*x*x  -  y*y*y*Y  +  2*y*y;  (*  REAL  RESULT  *) 

W  9*X*X*X*X  -  Y*Y*Y*Y  +  2*Y*Y ;  (*  INTERVAL  RESULT  *) 


WRITELN; 

WRITELN ( ' (a)  w  :-  9*x*x*x*x  -  y*y*y*y  +  2*y*y;  gives:'); 

WWRITE(w,W) ;  (*  OUTPUT  RESULTS  *> 

w  •«  9*(x**4)  -  y**4  +  2*(y**2);  (*  REAL  RESULT  *) 

W  :-  9*(X**4)  -Y**4  +  2*(Y**2);  (*  INTERVAL  RESULT  *) 

WRITELN; 

WRITELN('(b)  w  :«  9*(x**4)  -  y**4  +  2*0**2);  gives:'); 

WWRITE(w,W);  .  (*  OUTPUT  RESULTS  *) 

w  :-  (3*(x**2)  -  y**2)*(3*<x**2)  +  y**2)  +  2*(y**2);  (*  RFAL  RESULT  *) 

W  :«  (3*(X**2)  -  Y**2)*(3*(X**2)  +  Y**2)  +  2*(Y**2); 

(*  INTERVAL  RESULT  *) 


WRITELN; 

WRITELNC(c)  w  :-  (3*<x**2)  -  y**2)*(3*(x**2)  +  y**2)  +  2*(y**2);  gives:'); 
WWRITE(w.W);  (*  OUTPUT  RESULTS  *) 

WRITELN; 

WRITELN( 'MORE  VALUES  (Y/N)?');READ(C,C)  (*  CON*^'  .QUIT*) 

END  (*  ACTUAL  CALCULATION  *) 


END 


(*  MAIN  PROGRAM  *) 


2.  Sample  results  using  the  program  WEVAL. 


B>XQPC  WEVAL 

ENTER  VALUES  OF  X  AND  Y 

*10864  18817 

(a)  w  :»  9*x*x*x*x  -  y*y*y*y  +  2*y*y;  gives: 

H  -  1.58978000000E+05 

V  -  [ -1 . 84102200000E+06 ,  1 .l*897800000E+06] 

(b)  v  9*(x**4)  -  y**4  +  2*(y**2)j  gives: 
w  -  -8.41022000000E+05 

W  -  [-3.41022000000E+05,  1.15897800000E+06] 

(c)  w  :“  (3*(x**2)  -  y**2)*(3*(x**2)  +  y**2)  +  2*(y**2);  gives: 
v  «  1. 000000 OOOOOE+OO 

V  -  [  1.00000000000E+00,  1 .OOoOOOOOOOOE+OO) 

MORE  VALUES  (Y/N )? 

*N 

KL/P'-STOP 
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1.  Source  code  for  the  program  LINSYS. 

PROGRAM  LINSYS ( INPUT , OUTPUT ) ; 

(*  This  program  computes  an  interval  vector  Y  containing  the  solution 
of  linear  systems  of  equations  AX  *  B  up  to  order  20,  or  returns  an 
error  message  if  the  coefficient  matrix  A  is  singular  or  extremely 
badly  conditioned.  The  matrix  A  is  read  by  rows,  followed  by  B.  *) 


$USES  LGL,DIM~20; 
VAR 


(*  SETS  DIMENSION  FOR  EXTERNAL  LIBRARY  ROUTINES  A) 


A:  RMATRIX 
B:  RVECTOR 
Yj  IVECTOR 

AKDIM:  INTEGER; 
I, J:  DIMTYPE; 
C:  CHAR; 


{*  COEFFICIENT  MATRIX  *) 

<*  RIGHT  SIDE  *) 

(*  INCLUSION  OF  SOLUTION  OF  AX  -  B  *) 

(*  ACTUAL  SIZE  OF  SYSTEM  *) 

(*  INDE.”  VARIABLES  *) 

(*  CONTROL  VARIABLE  *) 


BEGIN  (*  MAIN  PROGRAM  *) 

C:='Y';  WHILE  C  -  'Y'  DO 

BEGIN  (*  SOLUTION  OF  SYSTEM  *) 

WRITELN( *  INPUT  DIMENSION' ) ; READ (AKDIM) ;WRITELN; 

WRITELNC  INPUT  MATRIX  BY  ROWS'); 

FOR  I:-l  TO  AKDIM  DO  FOR  J:«l  TO  AKDIM  DO  READ(A[I,J]) ; 

WRITELN ; WRITELN( ' INPUT  RIGHT  SIDE' ) ; 

FOR  Ij»1  TO  AKDIM  DO  READ(B[I]); 

LGLP(DIM,AKDIM,A,B,Y) ;  (*  SOLVE  SYSTEM  *) 

IF  YtU.INF  O  YllJ.SUP  THEN  (*  Y  IS  PROPER  *) 

BEGIN  (*  OUTPUT  OF  SYSTEM  AND  RESULTS  *) 

WRITELN; 

WRITELNC SOLVE  AY  -  B  WITH  INTERVAL  INCLUSION  OF  ANSWER'); 
WRITELN; 

FOR  J:«l  TO  AKDIM  DO  (★  OUTPUT  OF  A  BY  COLUMNS  *) 
BEGIN 

FOR  I:»J.  TO  AKDIM  DO 

WRITELNC A[ ' ,1:2,' ,' ,J:2,' ]  -  ' ,A[I,J]) ;WRITELN; 
END;  (★  OUTPUT  OF  A  *) 

WRITELN ; 

FOR  I:»l  TO  AKDIM  DO 

WRITELN('B[',I:2,'J  »  ',B[lJ);  (*  OUTPUT  B  *) 

WRITELN; FOR  Ii-1  TO  AKDIM  DO  (*  OUTPUT  Y  *) 

WRITELN( ' Yt ' ,1:2,' j  -  [ ' ,Y[I].INF, ' , ' ,Y[I] .SUP, ' ]') ; 


END 


(*  OUTPUT  OF  SYSTEM  AND  RESULTS  *) 
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ELSE  (*  Y  IS  IMPROPER  *) 

BEGIN  (*  ERROR  MESSAGE  *) 

VRITSLN ; WRITELN ( ' THE  MATRIX  IS  SINGULAR  OR  BADLY  CONDITIONED1); 
FOR  I:»l  TO  AKDIM  DO  Y[ll  <>SUP:»Y[I]  .INF  (*  RESET  Y  *) 

END;  (*  ERROR  MESSAGE  *) 

WRITELN; WRITELN ('ENTER  ANOTHER  SYSTEM  (Y/N)?:): 

READ(C,C); 

END  (*  SOLUTION  OF  SYSTEM  ★) 

END.  (*  MAIN  PROGRAM  *) 


2.  Sample  results  using  LINSYS 


A>XQP  LINSYS  TAK.DAT  CON: 
INPUT  DIMENSION 

INPUT  MATRIX  BY  ROWS 

INPUT  RIGHT  SIDE 


SOLVE  AY  «  B  WITH  INTERVAL  INCLUSION  OF  ANSWER 


St 


ai;  i,  i]  « 

A[  2,  ij  - 
A[  3,  1]  - 
A[  4,  1]  » 
A[  5,  1]  - 

A[  1,  2]  - 
Al  2,  2]  - 
A[  3,  2]  - 
A[  4,  2] 

A[  5,  2] 

A[  1,  3]  - 
A[  2,  3]  » 
A[  3,  3]  - 
A  [  4,  3  ]  “ 
A[  5,  3 J  - 

A[  1,  4]  ■ 
A[  2,  4]  - 
A[  3,  4]  = 
Al  4,  4 J  » 
A[  5,  4]  - 

Al  1,  5]  - 
Al  2,  5]  - 
Al  3,  5]  - 
Al  4,  5]  - 
Al  5,  51  - 


•1 * 91569421219E+11 
0.00000000000E+00 
8.54599767S33E+08 
O.OOOOOOOOOOOE+OO 
O.OOOOOOOOOOOE+OO 

O.OOOOOOOOOOOE+OO 
6.125803287135+11 
O.OOOOOOOOOOOE+OO 
4.67810493936E+07 
2 . 73290749688E+09 

l.OOOOOOOOOOOE+OO 
-l.OOOOOOOOOOOE+OO 
1 . 12080226545E+02 
O.OOOOOOOOOOOE+OO 
1.12075040461E+02 

O.OOOOOOOOOOOE+OO 
5 . 21250000000E-01 
O.OOOOOOOOOOOE+OO 
l.OOOOOOOOOOOE+OO 
O.OOOOCOOOOOOE+OO 

-5.03360090692E-03 
1 . 60975964126E-02 
2 . 24535604687E-05 
O.OOOOOOOOOOOE+OO 
7 . 1S104363616E-0 5 


Bl  1] 
2] 
3  J 
•  I 


1 . 85094646224E+04 
- 5 . 91833428873E+04 
-8.25723911920E+01 
1.92589687381E+33 
2.15844970057E+35 


ii  ’ ,  *  l  7.93624477937E+29,  7.9362447  938E+29J 
Yl  2]  »  i  7 . 93739034599E+29,  7.93739034600E+291 
Y{  3]  -  t-4.2000576C703E+32,-4.20005760702E+32j 
Yl  4]  «  1 -3 . 71300190864E+37 ,-3 . 71300190863E+37 ] 
Yl  5]  -  l-3.02038610401E+43,-3.02038610400E+43l 

ENTER  ANOTHER  SYSTEM  (Y/N)? 

KL/7-STCP 
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TC  StnSCTf  'M3WT-  .A.  XT* 


E^rrcsiMsr  ^s^aejetfjsjcsaLCi^'SJvsi*  jx  s*  iraTTFus^s 


3.  Contents  cf  the  data  file  TAK.DAT. 


A>TYPE  TAK.DA". 

5 

-1  *915694?,1219E+11  0  1  0  -5.03360090692E-03 
0  fc .  12580328713E+11  -1  5.2125E-01  1.60975.964126E-02 
8.54599767833E+08  0  l.-12C80226f'»5E+02  0  2.24535604687E-05 
0  4 . 67810493936E+07  010 

0  2 . 73290749688E+09  1.12075040461E+02  0  7 .18104363616E-Q5 
1.850946462242+04  -5.91833428873E+04  -8.25723911920E+01  1.92589687381E+33 
2 . 15844970057E+35 
N 
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OPTIMAL  CORRECTIONS  OF  A  DAMPED  LINEAR  OSCILLATOR 


UNDER  RANDOM  PERTURBATIONS* 

P.  L.  Chow  and  J.  L.  Menaldi 
Department  of  Mathematics,,  Wayne  State  University 
Detroit,  Michigan  48202 

ABSTRACT  An  additive  control  of  a  randomly  excited  linear  damped 
oscillator  is  studied  by  the  methods  of  dynamic  programming  and 
variational  inequalities.  The  objective  is  to  minimize  the  mean  deviation  from 
the  rest  position  over  a  finite  horizon,  with  a  possible  resource  constraint.  We 
shall  present  some  analytical  results  on  the  optircal  average  cost  function  and 
the  feedback  control  law.  Numerical  solution  will  be  discussed  briefly. 


I.  INTRODUCTION.  The  problem  of  controlling  random  vibrations  occurs  in  a  wide 
range  of  situations,  from  the  vehicle  dynamics  to  the  stabilization  of  a 
skyscraper  during  the  earthquake.  In  this  article  we  study  a  relatively  simple 
problem  involving  a  unit  mass  attached  to  a  spring  and  a  dash-pot.  If  the  system 
is  excited  by  an  external  random  force,  one  is  interested  in  regulating  the 
fluctuations  about  the  rest  position.  The  problem  would  have  been  classical  if 
the  controlling  force  were  smooth  and  unlimited  in  its  energy  supply.  However  we 
will  be  mainly  concerned  with  more  realistic  class  of  controls,  where  the  impul¬ 
sive  forces  are  allowed  and  the  total  energy  supply  is  fixed  over  a  finite  time- 
horizon.  This  makes  the  problem  more  meaningful  but  complicated.  But  it  is 
essential  to  take  the  necessary  first  step  in  order  to  understand  the  more 


This  work  has  been  supported  in  part  by  the  U.S.  Army  Research  Office  under 
Contract  DAAG29-83-K-00K  . 
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difficult  problems  with  several  degrees  of  freedom. 

In  a  recent  paper  [1],  we  treated  the  problem  of  controlling  a  first-order 
Ito  equation,  which  may  be  regarded  as  a  special  case  of  the  present  problem.  It 
corresponds  to  the  limiting  case  when  the  spring  constant  tends  to  zero.  Our 
work  is  a  generalization  of  the  results  by  Gorbunov  [2]  in  the  sense  that  we 
admit  a  wider  class  of  controls.  Furthermore  our  techniques  are  rather  different 
from  his.  For  the  general  mathematical  techniques  involved,  one  is  referred  to 
the  references  [3]  -  [5]. 

The  article  summarizes  some  preliminary  results  concerning  our  investiga¬ 
tion  in  the  announced  subject.  In  what  follows,  we  shall  first  formulate  the 
problem  under  study  and  describe  the  analytical  results  that  we  have  obtained. 
Also  the  numerical  solution,  among  others  will  be  discussed. 


II.  FORMULATION  OF  PROBLEMS.  Let  us  consider  a  damped  linear  oscillator  excited 
by  a  white-noise  and  controlled  by  a  regulating  force: 

••  •  2  • 

r  x  +  px  +  k  *  rw  +  v  ,  o  <  t<T, 

\  U> 

L  X(O)  »  Xq  ,  x(o)  »  , 

where  the  dot  means  the  time  derivative  and 


x(t)  :  the  position  of  a  unit  mass  at  the  time  t, 

q,  k  :  the  damping  and  spring  constants, 

x^,  yQ  :  the  initial  position  and  velocity, 

r  wt  :  the  white-noise  with  the  intensity  r  , 

v  :  the  momentum  control  at  t, 

T  :  the  horizon. 
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Since  the.  system  will  be  controlled  by  a  finite  source  of  energy »  we  introduce 
the  class  V  of  controls  consisting  of  all  random  processes 
{  v  ,  t  5to},  adapted  to  the  Wiener  process  {w  ,  tio},  with  a 
bounded  variation. 

To  give  the  equation  (1)  a  mathematical  meaning  we  interpret  it  in  the  Ito's 
sense : 

,  dx  =  ydt  , 

dy  =  -(px  +  qy)dt  +  rdwfc  +  dvfc,  (2) 

.  x(o)  =  x  ,  y(o)  =  y 

o  o. 


2 

where  y  *  i.  and  p  =  k  . 

Let  t  v  denote  the  positive  and  negative  parts  of  vt  »  so  that 

Vt  =  Vt+  ~  Vt”  *  C  *  °  '  <3> 

Suppose  the  total  supply  of  correctional  energy  is  fixed.  Then  we  have 
the  constraint 


+  — 

v  =  vt  +  vfc  a:  z,  tio  , 


(4) 


where  z  >  o  is  propotional  to  the  total  correctional  energy  supply  and 
will  be  called  simply  the  correctional  energy.  Let  V  ^  be  the  class  of 
admissible  controls  v  in  V  satisfying  (4). 
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For  each  control  policy  v  ,  let  J(v)  be  the  average  cost  defined  by 

J(x  >  yn‘  “  Ev{  JTf  Cx(t>»  1  dt  +  g  t*(T>.  y(D  ( 

o  o  v  *0 

where  f  and  g  are  some  positive  function neasuring  the  deviation  from 
the  rest  state. 


Our  goal  is  to  find  an  optinal  policy  v  in  V  in  the  form  of  the 

feedback  law,  which  minimizes  J,  i.e. 

J  *  J  (  v  )  **  inf  { J  (  v  )  :  v  e  V,}. 

ac 


To  this  end  we  shall  use  the  techniques  of  dynamic  programming  and 
variational  inequalities . 


III.  PENALIZATION  AND  DYNAMIC  PROGRAMMING  Instead  of  (2.2),  we  consider  the 


following  system 

•  dx(s)  «  y(s)  ds  , 

dy(s)  «[px(s)  +  qy(s)]  ds  +  rdw  +  dv  ,  ( 


dz(s)  -  -  d  j  v_l*  -(dv  +  +  dv  "),  o  <  s  *  (T-t), 


s  s 


dt(s)  *  ds. 


x(o)  «  x  ,  y(o)  *  y,  z(o)  -  z  t(o)  -  t. 


where  z(s)  is  the  remaining  amount  of  correctional  energy.  Corresponding  to 

the  cost  function  (5),  the  running  cost  from  t  to  T  is  given  by 

T-t 

J(x,  y»  i, t,v)  *E  J*  f  [x(s),  y(s),  y(s),  y(s)]  ds  +  8(x(T-t),  y(T-t)  ].  ^ 


The  optimal  cost  is  given  by 

h (x,  y,  z,  t)  -  inf  (J  (x,  y, z ,  t, v ):  v  e  V  ,  }  . 

aa 

In  order  to  apply  the  dynamic  programming  principle  to  find  u,  we  introduce 


a  regularized  version  of  Vad  defined  by 


v«d  "  <v  E  v.d ! *t  ’  v’  i',l*  i  >»  E  > 


(10) 


We  set 


u_(x»  y»  Z»  t)  -  inf  {J  (at,  y,  z,  t-,  v)  :  '  e  v  ,e}. 
*■  ad 


(11) 


Then  the  standard  dynamic  programming  argument  yields  the  optimality 

*  \ 

equation  for  u  : 

C 

^  inf  L  (v)  u£  =  -f(x,y),  x,  y  eS,  z  >o,  o<t  <T,  |v|  jg.l- 

“  g(x,  ?)  »  (12) 

■  u°(x,  y,  t). 

Here  we  have  put 


<  ue  (x,  y,  z,  t) 
u  (x,  y,  o,  t) 


'  L  (v)  *  L0  +  4l  (v) 

2  2 

,  L0  =  \  +  *5  r  oy  +  y  \  ~(px  +  qy)  dy, 
.  Aj  (v)  a  vdy  "  |v|  3Z  > 
and  u  satisfies 

Z.0u°  =  -f 
u°  (x,  y,  T)  a  g. 


(13) 


(14) 


In  (13)  the  partial  derivatives  at  ,  ^  .  stand  for  the  partial 

derivatives  d~.  .  cL  ,,,, 

Sf  Sx 

It  seems  reasonable  to  expect  that,  as  e  +  o,  the  solution  u  of  the 

e 

penalized  problem  (12)  approaches  the  optimal  cost  u  giver  by  (9), 
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'-A'. 


This  fact  among  other  results  will  be  described  in  the  next  section 


K 


/ 


IV.  SOME  RESULTS  AND  DISCUSSION.  First  let  us  present  some  preliminary  results 
pertaining  to  the  penalized  problem  (12)  and  its  relation  to  the  original  one. 
To  analyze  (12)  we  need  the  following: 

(R  1).  If  9Z  <P  *  °  ,  then 

in|  LAv)  cp  =  I  (  \\  cp\  +  3Z  cp  )+,  U|  *  i  »  (1 

1  e  3  6 

where  {x}+  denotes  the  positive  part  of  x. 

Under  appropriate  conditions,  we  can  show  that 
(R  2).  Assume  that  the  functions  f(x,  y)  *  f(x)^0  and  g(x,y)  *  g(x)  ^  ° 
are  convex  and  even,  and  the  functions  together  with  their 
derivatives  are  uniformaly  Lipschitz  -  continuous.  Then  the  optimal  cost 
function  ue  given  by  (11)  has  the  following  properties: 

(a)  For  each  t  >  0,  ue(x,  y,  z,  t)  is  convex  in  (x,  y,  z). 

(b)  u£(x,  y,  z,  t)  *  u£(-x,  -y,  z,  t)  for  any  (x,  y,  z,  t)  . 

(c)  u£is  Lip~continuou8  in  (x,  y,  z,  t),  (uniformly  in  e  ). 

(d)  '2>x  G£  and  3yue  are  uniformly  Lip-continuous  in  x,  y  and  e  for 
each  (z,  t). 

(e)  tt£  (x,  y,  . ,  t)  is  a  decreasing  function  of  z  for  every  fixed  (x,  y,  t). 

A 

Further  u£  is  the  solution  of  the  Hamilton-Jacobi-Bellman  equation  (12). 
Remarks :  The  fact  that  the  optimal  cost  u£  decreses  with  the  total  correction 

energy  z  i3  intuitively  clear.  It  follows  that  cp  *  u  satisfies  the  condition 

e 
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in  (Rl).  The  symmetry  property  (b)  enables  us  to  consider  the  problem  either  in 
the  region  x  3:  o  or  y  a  o. 

Now  we  hope  to  recover  the  optimal  cost  u  by  taking  the  limit  of  u  as 

e  +  o  •  This  is  indeed  a  valid  procedure*  In  fact  we  can  say  much  more,  as 
indicated  by  the  next  result: 

(R  3).  Under  the  same  assumptions  in  ("  ,  we  have 

<u  (x,  y,  z,  t)  =  lim  u  (x,  y,  z,  t) , 
e+o 

where  u  is  given  by  (9) .  Furthermore  u  preserves  the 
properties  (a),  (b),  (c),  (e)  in  (R  2),  and  satisfies  the  variational 
inequalities: 

*  (Au  -  f)  s  0  and  M  (u)  s  0  , 

(Au  -  f)  .  M(u)  *  0,  z  >  o, 

U1  t=T  =  g  * 

.  u  | 

z=o 

where  A  *  -  L- o  with  Lo  given  in  (13);  u°  satisfies  (1*+)* 
and 

M  (<P)  =  |  By  CP  |  +  d2  CP  . 


Remark:  In  the  above  analysis,  we  have  neglected  the  control  cost,  i.e.  f,  g 
independent  of  v.  A  similar  analysis  has  been  carried  out  when  the  cost 
for  control  is  not  negligible. 

(R  4)  In  the  case  of  one-sided  control  (  v  •  v+  ),  let 

w  (x,  y,  t)  »  lim  u(x,  y,  z,  t)  (18) 

Z  -*■  oo 

Then  the  problem  (16)  reduces  to 


Finally  we  have  the  following  decomposition, 

(R  5).  For  a  one-sided  control,  the  problem  (16)  can  be  decomposed  into 
the  control-free  problem  (14)  and  the  problem  (19)  with  unlimited  control 
energy.  In  particular,  the  solution  to  (16)  can  be  expressed  as  : 
u(x,  y,  z,  t)  -  w(x,  y,  t)  +w°(x,  y  +  z,  t) 
w°(x,  y,  t)  *u°(x,  y,  t)  -  w(x,  y,  t). 


(20) 


With  the  aid  of  the  above  decomposition,  the  construction  of  solution  to 
(16)  is  greatly  simplified. 

In  spite  of  t’s  above  results,  the  following  questions  remain  to  be 
resolved: 

(Q.l)  Free  Boundary  The  system  (16)  is  a  terminal  and  free-boundary 
value  problem.  For  the  reduced  problem  the  free  boundary  r  :  y  ■ 
y  (x,t),  separates  two  regions  in  which  Aw*  f,  Bw  «  o, 
respectively.  The  smoothness  and  the  shape  of  r  have  yet  to  be 
determined. 

(Q  2).  Construction  of  the  Optimal  Control.  In  the  one-dimensional 
case,  we  showed  that  the  optimal  control  is  a  reflected  diffusion  from  the 
free  boundary  [1].  It  is  conceivable  that,  in  the  present  case,  the 
optimal  control  is  an  oblique-reflected  diffusion.  But  this  has  to  be 
verified. 

(Q  3)  Numerical  Approximation.  To  compute  the  optimal  cost  u  ,  we  may 
try  to  solve  the  variational  inequalities  (16)  numerically.  This  can  be 
done  by  an  iterative  scheme  and  the  finite  element  method  [6].  Another 
possibility  is  to  apply  (R2)  and  (R3)  by  solving  the  penalized  problem  for 
small  e  .  The  penalized  problem  (12),  in  view  of  (RT),  is  a  nonlinear 
parabolic  terminal-boundary  value  problem  in  (t,  x,  y,  z): 


"-K3-C 


— .  m.  *  «w  -w— i--%  *•  •  J--1 


f 

l 

i 

i 


L°  “E  -  iflay  “J  +  v£  > + 


f,  o  *t<  T, 


ue(x,  y,  z,  T) 
u.(x,  y,  o,  t) 


g(x,  y)  ,  . 

u°  (x,  y,  t). 


(21) 


which  is  coupled  to  the  problem  (14)  'or  u°.  Because  of  the  high 
dimensionality,  an  efficient  numerical  algorithm  for  solving  such  problem 
is  unknown  to  us. 

The  above  questions  and  others  are  being  investigated,  and  further  results 
will  be  presented  elsewhere. 


j  t.t:  v  -■*  . 


-.'^^’rncsr'irt  wizau-?"  -  j»  vr*«wrji^r^  Ti -J»SEfliiwi»XR 
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NON-PERIODIC  CONDITIONS  FOR  CHAOS  AND 
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and 
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Introduction 

Li  and  Yorke  [1]  introduced  in  their  fundamental  paper  the  term 
"chaotic"  for  a  clcss  of  self-mappings  of  an  interval.  The  real  function 
f  is  chaotic  if  (a)  there  are  points  of  arbitrarily  large  periods  and  (b) 
there  is  an  uncountable  set  S  such  that  for  every  xQ,  Yg  G  S*  x0  ^  ^0’ 

Him  sup|fn(x  )  -  fn(y  )j  >  0  and  Aim  inf|fn(x~)  -  fn(yQ) |  =  0.  Following 

n-w  u  ir*»  u 

the  Li -Yorke  result  that  "period  three  implies  chaos",  many  authors  worked 
on  periodic  conditions  that  allow  the  same  conclusion.  The  best  known  of 
these  conditions  is  "period  p  t  2n  implies  chaos".  Such  investigations  are 
summarized  in  Targonski's  monograph  [2]. 

Li  and  Yorke  also  introduced  four-point  inequalities  satisfied  by  a 
point  and  its  three  successors  with  respect  to  the  given  function  f.  They 
showed  that  these  imply  the  existence  of  a  three-period  and  hence  chaos.  Our 
investigations,  begun  under  the  US  Army  Summer  Faculty  Research  and  Engineer¬ 
ing  Program  1983,  show  that  the  Li-Yorke  inequalities  play  a  fundamental 
role  in  the  theory  of  cnaos.  For  this  presentation  we  have  singled  out  three 
theorems.  The  first  is  an  addendum  to  the  Li-Yorke  theorem.  The  second 
establishes  eauivalent  companion  inequalities  to  the  Li-Yorke  inequalities. 

The  third  theorem,  of  a  different  character,  is  especially  important  in 
applications.  We  also  introduce  the  elementary  notion  of  a  Newton  function 
(our  term)  which  we  have  found  indispensable  in  the  investigation  of  parameter 

families  and  which,  we  think,  deserves  to  be  better  known.  The  quadratic 
2 

mapping  g(y)  =  ay  +  2by  +  c,  a  0,  b,  c,  real  constants,  particular  cases 
of  which  have  received  wide  attention,  is  used  to  illustrate  certain  points. 
Finally,  we  state'  an  open  problem  that  is  of  interest  in  connection  with 
"random  number  generators". 


159 


Definitions  and  Notations 


Let  f  :  R  -*■  R  be  continuous.  If  Xq  £  R,  the  orbit  of  Xq  under  f  is 
defined  as  the  set  {x  :  x  =  fnix0).  n  =  0,1,...},  where,  for  every  positive 
integei  n,  f°  is  the  n-th  iterate  of  f  and  ^(Xq)  =  Xq.  We  shall  write 
xn  :=  for  a  given  xQ  =  R  and  call-  Xj,x2. . .  the  successors  of  xQ. 

A  pre-orbit  of  a  given  Xq  £  R  is  any  (finite  or  infinite)  sequence 

Xq,  x  j,  x  2>***  such  that  f(x  n)  =  x  ^  for  all  n  for  which  x  n  is  defined. 

The  points  x  y  x  2,...  i®  any  such  sequence  are  called  predecessors  of  Xq. 

A  point  x0  is  called  critical  if  f(xQ)  =  Xq,  i.e.,  a  critical  point  of  f  is 
a  fixed  point  of  f.  A  periodic  point  Xq  of  period  p  >  1  (pa  positive  integer) 
is  a  point  for  which  the  relations  ^(Xq)  «  xQ,  f^(xQ)  Xq,  1  <  k  <  p,  hold. 

The  following  fundamental  results  are  now  well-known. 


Theorem .  (Sarkovskii) .  For  m,  n  =  0,1,...  consider  the  total  ordering  of 
the  positive  integers: 

3  <  5  <  7  <  •••  <  2*3  <  2*5  <  <  2n*3  <  2n*5  <  2n*7 

<  •••  <  2^  <  2^""^  <  •••  <  2^  <  2  <  1 

If  a  continuous  mapping  f  :  R  R  has  a  periodic  point  of  period  p,  <-uen 
it  also  has  a  periodic  point  of  period  q  for  every  q  *  •  (in  the  above  total 
o-  r) . 

Theorem  (Li -York e) :  Let  f  :  R  -*■  R  be  continuous.  If  there  is  a  point 
Xq  6  R  such  that  either  x^  <  xQ  <  x^  <  x2  or  x^  >  Xq  >  Xj  >  x2,  then  f  has 

a  point  of  period  three.  Furthermore,  if  f  has  a  three  periodic  point,  there 

exists  an  uncountable  set  SCR  such  that  for  every  Xq,  yQ  £  S,  Xq  f  y^. 


Aim  sup  x  -  y  >  0 

_  ri  n  'n' 

rr*» 


Aim  infix  -  y  1  =  0. 
n-  1  n  yn' 


_•  •  *  *  ■  •  *  *  v  *  • 


Definition.  A  point  Xq  2  R  satisfies  a  Li-Yorke  inequality  if 


x3  <  x0  <  X1  <  x2 


x3  >  x0  >  x,  >  Xj. 


An  Example.  The  following  example  shows  that  the  existence  of  a  three 

periodic  point  does  not  guarantee  the  existence  of  a  Li-Yorke  inequality. 

2 

The  quadratic  mapping  g(y)  *  ay  +  2by  ♦  c,  a  0,  b,  c.  real  constants,  may 

2  -1 

be  brought  in  the  form  f(x)  =  x  -  r  by  setting  y  =  a  (x  -  b) , 

-1  2 

g  =  a  (f  -  b),  and  r  =  b  -  b  -  ac.  f  has  a  three  periodic  orbit  for 
r  =  7/4,  bu':  no  point  xQ  €  R  satisfies  a  Li-Yorke  inequality  for  r  <  7/4. 

Definition.  Let  f  :  R  -►  R  be  continuers.  A  Newton  function  of  f  is  defined 
as  a  nonnegative  solution  of  the  differential  equation  N(x)  =  (x  -  f(x))N'(x). 


It  is  clear.*  from  the  definition  that  N  ie  unique  up  to  a  multiplicative 

constant,  piecewise  monotone  and  differentiable.  The  application  of  the 

traditional  method  of  Newton  for  solving  iteratively  the  equation  N(x)  =  0 

results  in  a  "Newton  sequence"  that  is  identical  with  the  iterates  =  f11  (xQ) 

Thus  by  determining  the  Newton  function  of  a  given  f  a  familiar  geometric 

picture  for  the  evolution  of  the  iterates  becomes  available  that,  in  our 

opinion,  is  superior  to  others.  Although  the  qualitative  and  quantitative 

characteristics  of  a  Newton  function  that  signal  existence  or  nonexistent 

of  chaos  await  exploration,  its  knowledge  certainly  settles  the  question  of 

convergence  at  a  glance  and,  in  any  case,  provides  an  invaluable  source  of 

visual  insight.  For  example,  V  considering  the  Newton  function  of 
2 

f(x)  =  x  -  r,  it  is  apparent  that  (a)  the  iterates  converge  to  c,  if 
x0  €  (c  2»c2^  ant*  <  r  <  3/4,  where  Cj  and  c2>  c^  <  c2,  are  the  critical 
points  of  f,  f  w  (b)  no  orbit  originating  in  the  interval  (-c2,c2)  can 
escape  it  fc"  2  "  r  >  -  1/4. 


Three  Theorems  in  the  Theory  of  Chaos 


Theorem  1 .  If  €  R  satisfies  a  Li-Yorke  inequality,  then  f  has  at  least 
two  distinct  three  periodic  orbits. 

The  theorem  extends  tha  L’ -Yorke  theorem  and  may  be  illustrated  by  the 
2 

example  f(x)  =  x  -  2.  The  point  x^  =  /2  satisfies  a  Li-Yorke  inequality  and 
(2  cos  2ir/7,  2  cos  4ir/7,  2  cos  8m/ 7) ,  (2  cos  2m/9,  2  cos  4m/9,  2  cos  8ir/9) 
are  two  distinct  three-periodic  orbits  of  f. 

Theorem  2.  The  sets  of  points  A  =  {x-  :  x,  <  x_  <  x.  <  x„} , 

B  =  {xrt  :  x.  <  x,  <  x_  <  x.},  ana  C  =  {x_  :  x„  <  x,  <  <  x,}  are  either 

all  empty  or  all  non-empty.  The  same  statement  holds  I.  t‘e  sets  with  all 
inequalities  reversed. 

The  checrem  extends  the  Li-Yorke  theorem  by  establishing  equivalent 
companion  inequalities  to  the  Li-Yorke  inequalities.  Since  examples  show 
that  other  four-point  inequalities  between  xQ,  x2,  x3  uo  not  assure  the 
existence  of  three-periodic  orbits,  the  theorem  lists  the  complete  set  of 
Li-Yorke  four -point  inequalities. 

For  the  proofs  of  Theorem  1  and  Theorem  2  the  reader  is  referred  to  [4]. 


Theorem  3.  If  a  critical  point  cQ  of  f  has  predecessors  satisfying 

c-2  •=  c0  <  c-3  <  c-l- 
then  f  has  a  six-periodic  point. 


Proof.  Since  f(c_2)  =  c  ^  >  c  7  and  f(Cg)  =  Cq  <  c  there  exists 

c_4  €  (c_2>  cq)»  and  f(c_3)  =  c_*>  <  c_4 >  fCc0)  c0  >  c_4  imply  the  existence 

of  a  c  5  €  (Cj,  c  ^) .  Hence 

c  -  <  c.  .  <  cn  <  c  .  :  c  ,  <  c  . . 

-2  -4  0  •  .  -3  -J 

If  we  set  c  5  “  Xq,  we  obtain  x^  <  x^  <  x7  <  x^,  i.e.,  Xq  satisfies  a 
Li-Yorke  inequality  for  the  function  f^.  Therefore,  f^  has  ?.  three-periodic 

k 

point  x  that  is  either  a  six-periodic,  or  a  three-periodic  point  of  f.  If 

★  k 

x  is  a  six-periodic  point  cf  f,  the  proof  is  complete.  If  x  is  a  three- 
periodic  point  of  f,  the  theorem  of  Sarkovskii  ensures  that  f  has  also  a 
six- periodic  point.  This  completes  the  proof. 


The  theorem  subsumes  many  propositions  that  ensure  chaos  via  the 

concept  of  snap-back  repeller,  first  introduced  by  Marotto  [5].  (Given  a 

critical  hjint  c^,  an  interval  (a^,  b^)  containing  cQ  is  called  a  repelling 

interval  if  for  every  €  (aQ,  bQ) ,  xQ  /  c^,  there  is  an  n  >  1  such  that 

x^  £  (a^,  b^).  A  critical  point  Cq  is  a  repeller  if  it  has  a  repelling 

interval.  A  repeller  c^  is  called  a  snap-back  repeller  if  ever/  interval 

(a^,  bQ)  containing  cQ  contains  a  point  x^  /  such  that  x^  =  cQ  for  some 

positive  integer  n  >  2.)  For  example,  by  adding  the  condition  that  the 

2 

interval  (c^,  c  does  not  contain  a  critical  point  of  f  ,  one  can  conclude 
that  Cq  is  a  snap-back  repeller.  We  intend  to  explore  the  connection  between 
our  inequality  condition  and  the  snap-back  repeller  condition  for  chaos 

elsewhere . 

2 

S:  .<n:plc.  If  f(x)  =  x  -  r,  then  f  has  a  six  periodic  point  for  all  r  >  r_, 

where  r^  «  1.543689011  is  the  real  root  of  r*5  -  2r  +  2r  -  2  =  0.  And  if 

we  apply  the  theorem  to  the  smaller  of  the  two-periodic  points  of  f,  which 

■> 

is  a  critical  point  of  f",  we  conclude  the  existence  of  a  twelve-periodic 

orbit  of  f  and  hence  chaos  for  all  r  >  1.4305.  This  is  the  best  prediction 

2 

to  date  that  chaos  has  occured  for  f(x)  =  x  -  r,  for  which  the  onset  ot 
chaos  at  r*  1.402  was  determined  by  computer  studies. 

2 

An  Open  Problem.  Consider  the  sequence  of  random  variables  xn+.  =  xn  -  2 

with  xQ  uniform  over  [-2,2].  fheri  xn  converges  in  distribution  to  the 

random  variable  x  whose  distribution  is  given  by  F(x)  =  1  -  (1/it)  arc  cos  x/2, 

2 

-2  <  x  <  2.  What  is  the  limit  in  distribution  of  x  .  =  x  -  p, 

n  1  1/9  i/2 

3/4  <  p  <  2,  where  xQ  is  uniform  over  [-  1/2  -  (p  +  1/4)*'  ,  1/2  +  (p  +  1/4)*'  ] 
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ABSTRACT  We  present  an  initial-value  technique,  based  on  the  use  of 
certain  compound  matrices,  for  the  numerical  solution  of  linear  two-point 
bounucry-value  problems  involving  unstable  ordinary  differential  equations  of 
the  singular  perturb?'.-  ion  type.  We  demonstrate  the  effectiveness  of  the 
method  via  certain  u — raples  which  exhibit  internal  as  well  as  end-point 
boundary-layers . 

I.  introduction  in  two  earlier  papers  [l,  2],  we  introduced  the  compound 
matrix  method  for  the  numerical  solution  of  mathematically  unstable  linear 
eigenvalue  and  boundary-value  problems  with  separated  boundary  conditions. 

The  focus  of  these  previous  studies  has  been  on  fourth-order  problems 
involving  single  differential  -!quations  of  the  Orr-Sommerfeld  type.  The  basic 
ideas  developed  in  [1,2]  have  since,  however,  been  generalized  in  a  variety  of 
ways,  and  these  include  the  use  of  compound  matrices  to  deal  with 
boundary-vaiue  problems  involving  higher-order  systems  of  equations  [3].  The 
method  has  also  been  successfully  applied  to  a  number  of  problems  arising  from 
the  study  of  hydrodynamic  stability  [4,  5,  6,  7].  In  this  paper,  we  wish  to 
demonstrate,  via  examples,  that  t-he  method  can  also  be  effective  in  dealing 
with  certain  second-order  problems  cf  the  singular  perturbation  type. 


II.  THE  COMPOUND  MATRIX  METHOD  For  our  purpose,  it  is  convenient  to 
first  discuss  the  compound  matrix  method  in  terms  of  a  t’ ird-order  problem. 
Consider  then  a  single  third-order  equation 

.1*3 (♦)  *  ♦*"  -  ca*m  -  c2^*  -  c3*  -  f,  (1] 


where  c^,  C2,  03,  and  f  are  functions  of  x  and  a  <  x  <  b.  We  let  jjj  =  [y,  »j>1 , 
iT 

;  we  shall  also  suppose  that  a  single  boundary  condition  is  prescribed  at 
x  =  a  and  is  given  by 


PiMa)  =  p,  (2) 

where  P  is  a  1  x  3  row  vector  and  p  is  a  constant.  The  remaining  two  boundary 
conditions  at  x  =  b  are  given  by 

Q*(b)  «  q,  (3) 

where  Q  ia  a  2  x  3  matrix  of  rank  2  and  q  is  a  2  x  1  constant  vector.  If  we 
let  i|>Q~denote  the  solution  of  {1}  which  satisfies  (2)  and  let  t|^  and  denote 

A/  aT 

two  linearly  independent  solutions  of  the  corresponding  homogeneous  problem 
(with  f  =  0  and  p  =  0) ,  the  solution  to  (1)  -  (3)  can  be  vjritten  in  the  form 

i>  =  +0  +  a^l  +  S'f’2»  (4) 

fw  /v  /v 

for  some  constants  a  and  8.  In  the  usual  application  of  the  method  of 
complementary  functions,  and  must  be  computed  as  solutions  to  three 

A#  M  M 

separate  initial-value  problems.  The  boundary  conditions  at  x  =  b  tnen  lead 
to  a  pair  of  linear  equations  from  which,  at  least  in  principle,  the  constants 
a  and  3  in  (4)  can  be  determined.  Nevertheless,  it  is  well-known  that,  the 
coefficient  matrix  of  the  linear  equation;;  for  a  and  8  can  be  highly 
ill-conditioned  if  the  homogeneous  solutions  of  (1)  exhibit  inherent  growth 
problems,  in  that  case  a  further  loss  of  accuracy  will  occur  due  to 
cancellation  errors  as  i}>  must  be  obtained  from  (4)  by  superposition. 

AM 

Alternatively,  the  compound  matrix  method  is  based  on  considering  the 
solution  matrices  ¥g  and  ¥  where 


We  note  that  the  2x2  minors  of  ¥  are  defined  by 


y  *  ,  y  y  ~  W  * 

1  12  12  2  1  2  12  3  12  12 


and  y  =  [yj_,  y?,  y3JT  is  called  the  second  compound  of  ¥.  The  only  3x3 

AM  ^ 

l.inor  of  ¥  is  simply  its  determin-  nt  which  can  be  expressed  in  the  form 


i  =  y  V  -  ■  *■  y  P  , 

1  0  2  0  3  0 


and  2  is  called  the  third  compo  ’nd  of  ¥q.  On  differentiating  (6)  and  (7)  and 
eliminating  the  third  derivatives  by  the  use  of  (1) ,  we  the*  obtain 


y’  =  y  , 
1  2 


y' 

2 


=  y  +cy  +  c  y  , 
3  12  2 


y'  =  c  y  -  c  y  , 
3  13  3  1 


(8) 


z’  =  c  z  +  y  f, 

1  1 

where  the  initial  conditions  for  y-j ,  y2,  y3  and  z  can  be  derived  from  the 
corresponding  conditions  on  $q,  ipf,  and  4>2  by  using  (6)  and  (7). 

Assuming  that  we  have  obtained  y1 ,  y2,  y3  and  z  by  integrating  (8)  from 
x  *  a  to  b  subject  to  the  appropriate  initial  conditions#  the  solution  tf>  can 
be  determined  in  the  following  manner.  We  note  that  (4)  can  be  rewritten 
as  a  homogeneous  system  of  three  linear  equations  of  the  form 


-  1>Q)  *2 

1 

(tj>-*0)’  1>1  i>2 

-a 

t  1  «  t  « 

1 - 

? 

1 

0 
' — ' 

— * 

-<• 

-0 

The  existence  of  a  non-trivial  solution  to  (9)  immediately  implies  that  $  must 
satisfy  the  "auxiliary"  equation 

yi/  -  y2*'  +  y3*  «  z  .  do) 

Hence  to  determine  4>,  we  integrate  (10)  backwards  from  x  *  b  to  a  subject  to 
the  prescribed  boundary  conditions  at  x  =  b.  Moreover,  it  can  easily  be  shown 
[2]  that  the  solution  ♦  thus  obtained  is  indeed  the  solution  of  the 
boundary-value  problem  ( 1 ) - ( 3 ) . 

The  foregoing  analysis  can  now  be  adapted  to  deal  with  boundary-value 
problems  involving  a  second-order  equation  of  the  form 

Lz(4>)  -  -  c^'  -  c2<f>  =  f  (11) 

together  with  separated  boundary  conditions  at  x  =  a  and  b.  If  we  now  let 
$  =  ♦ ' ,  then  Eq.  (11)  becomes 

♦  -  c^"  -  c2i>’  =  f  (12) 

and  cur  discussion  of  third-order  problems  is  then  directly  applicable  on 
setting  c2  =  0.  Moreover,  a  further  simplification  is  possible  if  we  assume, 
as  is  often  the  case,  that  the  boundary  condition  at  x  =  a  is  imposed  on 
either  $  or  In  that  case  we  have  y3(a)  =  0  and,  with  c3  =  0,  the  third  of 

£qs.  (8)  shows  that  y3(x)  =  Or  Thus,  it  is  sufficient  to  integrate  the 
remaining  three  of  Eqs.  (8)  from  x  *  a  to  b  to  determine  y1 ,  y2,  and  z.  The 
solution  $  can  then  be  obtained  by  integrating  the  first-order  equation. 

y-i$'  -  y2$  =  z  (13) 


from  x  *  b  to  a. 
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Ill,  EXAMPLES  AND  DISCUSSION  In  order  to  demonstrate  the  effectiveness 
of  the  procedure  just  described,  we  have  used  it  to  compute  the  solutions  for 


-1  <  x  <  1  of  the  equations  [s] 

e2(j,"  +  X<J>'  =  0,  (14) 

e2<}>"  +  (x  cos  x)  4> •  +  (x  +  x3)  $  =  0,  (15) 

eV  +  x$'  -  $  =  0,  (16) 

,  ,,  1 

s2$  +  x$'  -  -  4  =  0,  (17) 

2 

e2<J>"  +  |  x|  1  +  (x  -  -  )3*  =  o,  (18) 

and  e2i|> "  +  (x2  -  -  )$*  +  x<j>  =  0,  (19) 

2 

each  subject  to  the  boundary  conditions 

<M-1)  -  1  and  *(1)  =  2  .  (20) 


Our  results  for  e  =  10“2  are  shown  in  Figures  1  through  6  and  they  were  found 
to  be  in  good  agreement  with  those  of  Pearson  [s],  we  note  that  the  solutions 
of  (14)  and  (15)  exhibit  internal  boundary- layers,  while  those  of  (16)  and 
(17)  possess  corner  layers  near  x  =  0.  The  solutio:  of  (18)  and  C9)  display 
boundary-layer  behavior  in  the  interior  of  the  interval  as  well  as  at  the  left 
end-point. 

All  integrations  were  carried  out  in  double-precision  arithmetic  on  an 
IBM  3081  computer  using  a  fourth-order  Runge-Kutta  routine  with  a  constant 
stepsize  h  =  0.0005.  The  need  to  choose  such  a  small  value  of  h  is  the 
consequence  of  the  use  of  a  constant  step  integration  scheme  and  the  rapid 
variation  of  the  solutions  within  the  boundary-layer  regions.  We  believe, 
however,  this  limitation  can  in  part  be  overcome  with  the  use  wf  a  more 
sophisticated  variable  step  integration  scheme,  although  this  in  turn  would 
complicate  somewhat  the  implementation  of  the  method  from  a  programming 
standpoint.  It  is  of  interest  to  note  that  the  FORTRAN  program  used  to  obtain 
the  present  results  contains  fewer  than  120  lines  of  codes.  This  then 
suggests  that  the  compound  matrix  method  can  provide  a  simple  alternative  to 
other  initial-value  techniques  such  as  orthonormalization  at  least  for  a 
certain  class  of  problems  of  the  singular  perturbation  type. 

We  also  note  that  in  constrast  to  other  initial-value  techniques,  the 
compound  matrix  method  first  derives  a  lower-order  auxiliary  equation  whose 
solutions  automatically  satisfy  the  boundary  condition  at  one  of  the  two 
end-points.  The  solution  to  the  original  boundary-value  problem  is  then 
obtained  by  integrating  the  auxiliary  equation  subject  to  the  boundary 
condition  at  the  second  end-point.  This  has  the  effect  of  pinning  down  the 
solution  at  both  end-points.  In  this  respect,  the  compound  matrix  method  is 
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not  unlike  other  boundary-value  techniques  such  as  finite-differences  and 
indeed  this  is  the  chief  reason  for  its  effectiveness  in  dealing  with  unstable 
boundary-value  problems. 
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SELF  SIMILAR  SOLUTIONS  FOR  A  DEGENERATE  CAUCHY  PROBLEM 

Klaus  Holllg  and  John  A.  Nohel 
Mathematics  Research  .Center 
University  of  Wisconsin-Madison 
Madison,  WI  53705 


ABSTRACT.  Wa  determine  the  self-similar  solutions  of  the  Cauchy  problem 

Vt  =  *(v)xx'  x  e  R,  t  >  0  , 

(P) 

v(x,0)  =  g(x) 

for  the  constitutive  function  $(  v)  =  max(0,v)  and  the  model  datum 


(P+  >xY , 


x  >  0 


(  -(p_) | x I • ,  X  <  0 

where  Y »p±  >  0.  It  is  shorn  that  the  unique  solution  of  (P),  (D)  is 


v(x,t)  * 


-(p_) |x!Y,  X  <  -ic/t 

t^t(£-),  *  ,  -«/t 

/t 


where 


♦«)  =  +  b2(x)Dy(i|)]exp(-|r) 


P.kY+1Dy(-  ~]exp(|-J 


b  (tO  - - - - 7-r-s - 

i/2  exp^^—1-  ni) 


b2(x) 


PJ^Vv-il'  ~)expl|") 


i/2  exp(*i 


Y  +  1 


and  k  is  implicitly  determined  by  the  equation 

Y+1  2 

P  :=  P+/P-  =  l~)  ^dexP(|“)  ; 

D  (•)  denotes  the  parabolic  cylinder  function  of  index  v. 


v 


i  ^  *  n_*  n  «■  ■»  *  t  »  " 


rm» .w.ir.rcrv.ini ir*£7*  u*v  tra*jv  iW£rx~u 


Proposition.  For  the  model  datum  (D)  problem  (P)  has  the  unique  self¬ 
similar  solution 

v(x,t)  =■  J,  x  >  s(t),  t  >  0 

✓t 

where  • )  is  the  unique  solution  of  the  ordinary  differential  equation  (1) 
below,  satisfying  the  initial  conditions  (2)  and  condition  (3)  at  infinity. 
The  free  boundary  is s(t)  =  -KYt,  t  >  0.  For  p  :=  p+/p_  >  0  given,  <  >  0 
is  the  unique  solution  of  the  equation 


Y+1  2 

P  =  fci  *>•,(“  “JexP(|-i  '  (E) 

/  2  Y  1  /2  8 

where  D_y_  -j  (  * )  is  the  parabolic  cylinder  function  of  index  ( -y  ”  1 )  • 

In  the  forthcoming  paper  with  Vazquez  [HNV]  we  will  use  the  above 
Proposition  to  analyse  (P)  for  more  general  constitutive  functions  and  more 
general  data;  we  assume  there  that  <f>  is  smooth  on  [0,»)  with  0  < 
c  <  <  C,  $(v)  =  0  for  v  <  0  and  that  the  more  general  data  satisfy 

G(x)  =  g(x)  +  o(|x|Y)  ( j  x  |  0)  , 

where  g  is  the  model  datum.  Then  it  will  be  shown  that 


with  ic  defined  as  before.  For  <j>(v)  =  max(0,v)  and  y  =  1  we  proved  in 
[HN2,3]  the  stronger  result 

s(t)  =  -ic/t  +  0(tV2+°)  (t  V  0) 

for  any  o  <  1/2.  In  this  case  (E)  reduces  to 

2  3  2  00 

P  =  T~  +  T~  exp^T"^  /  exp(-y2/4)dy  . 


Note,  that  a  first  order  expansion  of  the  equation  g(s(t))s'(t)  =  vx(s(t),t) 
in  (P)  formally  yields  a  different  result,  namely  * 

(p_)s(t)s' (t)  =  (p+)  +  •••  , 


which  yields 


s(t  >  =  -  /2pt  +  ••■»  , 

whereas,  e.g.  for  p  =  1,  ic  =  .9034  ...  *  /2.  The  reason  for  this  apparent 
inconsistency  is  that  all  derivatives  of  v  become  singular  at 
(x,t)  =  (0,0);  in  particular  vx  is  not  continuous  at  this  point. 
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2.  PROOF  OF  THE  PROPOSITION.  Substituting  v(x,t)  -  tY/2\|i(x//t )  in  (5) 
one  sees  that  i|i  must  satisfy  the  linear  differential  equation 

2*Hm  +  -  Y*(£)  -  0  for  5  >  -K  ,  (1) 

subject  to  the  initial  conditions 


W-k)  ■  0,  r<-K>  =  (pj  ~~2 
and  k  is  related  to  p±  via 

lim  CY*<£)  -  P+  . 


The  free  boundary  is  given  by 


s(t)  *  -Wt  . 


Equation  (1)  can  be  solved  explicitly.  Put  x  =  £//2  and  w(£)  “  ♦(x). 
Then  ( 1 )  becomes 

w"(x)  +  xw’(x)  -  yw(x)  -  o  .  (4] 

Setting  w(x)  »:  y(x)exp(-x2/4)  we  obtain 

1  v2 

y"(x)  -  (•£  +  Y  +  f-)y(x>  -  o  .  (5] 

This  differential  equation  has  the  general  solution  [B,  p.  116-117] 

y(x)  «  b1D_^_1  (x)  +  bjjB^(ix)  (-»  <  x  <  •,  y  >  0)  , 

where  Dy(  *)  is  the  parabolic  cylinder  function  of  index  v.  Thus  the 

general  solution  of  ( 1 )  is 


*<«  “  [b^Vii”)  +  VY(i|)]exp^) 


for  -»  <  £  <  »  and  Y  >  0.  To  impose  the  initial  conditions  (2)  we  need  the 
formulae  (above  ref.  p.  119) 

It  > 

It  '  %  Vi^)cx£>lT~)  • 


/2  '-'VJ 


Then  the  initial  conditions  (2)  yield  the  pair  of  equations 


(7) 


ikr  jr+i  J- 

\  >  »  .  .V  M.  (  \  ^  _  /  ^ 


-Vy(-  ^  *  iTb2Vl1'  7=>  '  p- 


eXP[  g~"  J 


Because  (5)  is  of  self-adjoint  form  the  Wronskian  of  D (• ) ,  D  _^(»)  is 
constant. 

M/ft  /  «  \  ft  /  ft  %  A  —  _ 4  ... _ f  I*  ]f_  ^  1 


W(D  .,(*),D  (•)•)  =  -iexp[(2-^-J-)iri]  . 


<p  )kY  ^  -  ^jexpt— J 

r  /J  8 

b  {K)  =  - ___£_ - 

1  i/2  exp[(^jri] 


(P.XY  ^zjexP(3~J 

/77 _ r/T  +  1  i_,i 


b,(>c)  = - - 

,i/2  exp[(-~— IjiriJ 


and  (6)  with  b-j  ,b2  given  by  (8)  is  the  solution  of  (1)  satisfying  the 
initial  conditions  (2).  To  compute  the  limit  in  (3)  we  use  (see  above  ref.  p. 
122) 

2 

Dv(z)  «  z  exp(-  [1  +  0( |z|“2)]  as  |z|  +  *  ,  (9) 

which  is  valid  for  -  ~  <  arg  z  <  ~  .  Tfcus  for  £  e  R,  y  >  0, 

4  4 

2  ~y~1 

D_Y  ,[— )  =  exp{-  H  +  0(  lei"2)],  *  +  ^  ' 

Y  1  /I  8  /2 

Idy^I  =  ^r-K^lexp^ln  +  Oder2)],  e  -  ■*-  .  (io> 

Substitution  of  (10)  and  (8)  into  the  general  solution  (6)  yields 


;)  =  b.  (tO'exp^jfe  [1  v  0(1)),  ?  -*•  4®  . 


From  formula  (8)  we  see  that 


b2(k)exp(^jpj  =  kY+1D-y-1(-  ~)exp(|-J  .  (12 

Imposing  the  asymptotic  condition  (3)  and  using  (11),  (12)  we  finally  obtain 


-  *'  *» 

V*;V* 
-  ^  _,*m 

k\*A 

v%* 

i 

tt&'iiZr': 
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VARIATIONAL  PRINCIPLE  FOR  PENETRATOR  DYNAMICS 
USING  BILINEAR  FUNCTIONAL  AND  ADJOINT  FORMULATION 

C.  N.  Shen 

U.S.  Army  Armament,  Munitions,  and  Chemical  Command 
Arama^at  Research  and  Development  Center 
Large  Caliber  Weapon  Systems  Laboratory 
Benet  Weapons  Laboratory 
Watervliet,  NY  12189 


ABSTRACT .  The  solution  problems  in  both  spatial  and  time  domains  using 
■’  inite  element  method  can  be  based  on  the  variational  principle  employing 
; linear  functional  r.nd  adjoint  formulation.  This  principle  is  extended  to 
•  "\:pliof>  ays  toms  in  matrix  vector  form  such  as  penetration  dynamics.  The 
X  >  ■sent  hype,"  not  ;c  type  partial  differential  equation  of  interest  has  two 
c’  ■>  ndei< a*''!  two  'ndepend  int  variables  with  the  coupling  in  the  spatial 
domain. 

I.  IhiRGDUCTION.  The  elastic-plastic  stress-strain  relations  for  e  rod 
derived  by  T.  Wright  and  the  differential  equations  of  the  rod  itself  have 
been  cast  in*-o  variational  form.  The  variational  principle  using  bilinear 
functional  and  adjoint  formulation  has  served  as  bases  to  numerical  solutions 
by  the  finite  element  method.  This  principle  has  now  been  extended  to 
coupling  systems  3uch  as  in  the  impact  dynamics.  The  present  hyperbolic  type 
partial  differential  equation  has  two  dependent  and  two  independent  variables, 
with  coupling  in  the  spatial  domain.  This  new  formulation  is  also  ready  to  be 
used  for  the  coupled  impact  problems  which  is  given  in  the  appendix  of  this 
paper. 

II.  THE  VARIATIONAL  PRINCIPLE.  A  dynamical  system  can  be  modeled  by  the 
matrix  vector  partial  differential  equation. 


MO  y(0  -  “Q(0 


(1) 


with  appropriate  boundary  and  initial  conditions.  In  the  above  equation,  L  is 
a  matrix  linear  operator  in  both  spatial  and  temporal  domain,  y  is  a  vector 
dependent  variable,  Q  is  a  vector  forcing  function,  and  5  represents  all 
Independent  variables,  both  spatial  and  temporal. 

The  inner  produce  <  >  of  an  adjoint  forcing  function  Q  and  the  solution 
(y(O)  of  Eq.  (1)  can  be  used  for  the  purpose  of  estimation.  This  inner 
product  is  <Q,y>. 


*The  author  is  also  employed  by  Rensselaer  Polytechnic  Institute,  where 
he  holds  the  title  Professor  in  the  Electrical  Computer  and  Systems  Engerineering 
Department. 
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An  accurate  estimation  can  be  made  by  constructing  a  variational 
principle  [1].  By  using  the  adjoint  variable  y  as  a  Lagrange  multiply  for 
Eq.  (1)  adding  to  <Q,y>,  we  have 

Jlly.y]  =  <Q,y>  +  <y,(Q+Ly)>  =  <Q,y>  +  <y,Q>  +  <y,Ly>  (2) 

To  keep  the  system  symmetrical,  let  us  define  the  adjoint  system  as 

L(Oy(5)  -  -Q(5)  (3) 

By  using  the  original  variable  y  as  a  Lagrange  multiply  for  Eq.  (3)  adding  to 
<Q ,  y> ,  we  have 

J2[y,y]  =  <Q>y>  +  <ys(Cr''Ly)>  =>  <Q,y>  +  <y,Q>  +  <y,Ly>  (4) 

The  relationship  of  the  aiicint  system  to  the  original  system  is 

A  - 

d  *  <y,Ly>  -  <y,Ly>  *  De  (5) 

where  D  is  the  bilinear  concomitant  [1J.  Combining  Eqs.  (2),  (4),  and  (5)  one 
obtains 


J1  *  ^2  +  De 

In  order  to  keep  the  functional  symmetrical,  we  have 


Jl  -  \  Ul  +  H  +  Del 

which  is  of  the  form 

1  -  1  —  oe 

vTi  «  <Q,y>  +  <y,Q>  +  -  <y,Ly>  +  -  <y,Ly>  +  — 

Similarly, 

1  -  1  —  De 

J2  ■  <Q,y>  +  <y,Q>  +  -  <y»^y>  +  -  <y»^y>  -  ~ 


(6a) 


(6b) 

(6c) 

(6d) 


111.  INTEGRAL  OF  BILINEAR  EXPRESSION.  The  integral  of  a  bilinear 
expression  for  a  two-dimensional  problem  having  a  system  of  second  order 
partial  derivatives  in  time  and  in  space  can  be  written  as 

xb  tb  - 

I  *  J  /  n[y(x,t),y(x,t)]dtdx  (7) 

xo  bo 

where  ft[y,y]  is  a  given  bilinear  expression  in  the  form 

nly,y]  "  ytT&yt  ~  yxTAyx  -  yTpy  +  yxTry  +  yTNy*  (g) 

The  subscripts  t  and  x  indicate  the  partial  derivatives  for  the  functions  y 
and  y.  The  matrices  A,  B,  and  P  are  diagonal  and  T  and  N  are  off-diagonal. 
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(10) 


Equation  (8)  can  be  integrated  by  parts.  Two  different  forms  of 
integration  and  end  conditions  are  given.  The  first  form  of  the  integral  is 
obtained  by  integrating  by  parts  on  the  adjoint  variable. 


which  gives 


la  *  -  //{yTBytt  -  yTAyxx  +  yTpy  +  yTpyx  -  yTNyx>dtdx 
+  /  vTBytdx  -  /  yTAyxxdt  +  /  yTPydt 

1Am  If  ~  w( b 2i wt t-ai  2wxx+p 1 2wf Yiux-  mux)  d  tdx 
+  //  -  u(b2  2utt~a2 2uxx+p2 2u+Y2W2~n2Wx)dxdt 

xb  -  tb 

+  /  [wbx^t  +  ub2ut]  dx  + 

*0  t0 

fxb  «_  _  -  ,  *b  tb  -  *b 

J  [wai^Wx  +  ua2  ux]  dt  +  J  [wYlu  +  uY2»]  dt 
xo  *o  tg  *o 

,xb  -  -  tb 

Ia  “  ~<y,Ly>  +  J  [bi^/Wt  +  b24uut]  dx 


(ID 


(12) 


tb  xb  tb  -  -  xb 

-  J  Ial  wwx  +  a24uux]  dt  +  J  [Yiwu  +  Y2«w]  dt 

tr 


(13) 


to  ^o  to  Xo 

On- the  other  hand,  we  can  perform  integration  on  the  original  variable  to  give 
ib  ■  -//{  yTBytt  ■  yTAyxx  +  yTpy  -  yTrTyx  +  yTNTyx)dtdx 


/yTBytdx  -  /yTAyxdt  +  /yTNTydt 


which  gives 


m 


xb  ■  J7(“w)  (bl  ^tt^ai^xx+pi  Y2ux+T12ux)dtdx 

+  II  ("u)(bi2utt-a22uXx+P22u“^w3C+niwx)dtdx 

,xb  ,  t  7~  tb  ,tb  o~  xb  tb  -  xb 

+  J  [wb]_<:wt+ub2<:ut3  dx  -  J  [wai^wx+uazux]  dt  +  J  [wn2u  +  uhiw]  dt  (15) 
*0  ^0  to  Xq  bo  Xq 

Xb  -  -  tb  tb  -  -  Xb 

lb  *  **<y.Ly>  +  J  [bi^-vft  +  b2Zuut]  dx  -  J  [a].2**^  +  a22uux]  dt  + 
xD  t0  t0  Xq 

tb  -  -  Xb 

+  J  [n2wu  +  niuw]  dt  (16) 


To  keep  the  £orm  symmetrical,  we  take  the  average  of  the  above  two 
expressioaa 

1  1  *b  tb  l  -  — 

1  -  -  Ia  4-  -  Ib  -  -/  /  -  (yLy+yLy)dtdx  +  . 

2  2  Xq  tQ  2 


+  J(yTByt+yTByt) |  dx  -  -  J(yTAyx+yTAyx) |  dt  +  -  /(yTPy+yTNTy)dt  (17) 

b0  2  Xq  2 

which  gives 


1  -  1  ,  “  1  xb  -  -  -  -  tb 

I  *  “  -  <y»**y>  “  “  <y»hy>  +  -  J  {b1i:(wwt+wwt)  +  b2*(uut+uut)]  dx 

2  2  2  Xq  tQ 

1  tb  -  -  -  Xb 

-  -  !  Iai2(wwx+wwx)  +  b22(uux4tiux)]  dt 

2  tQ  Xq 

1  tb  -  -  Xb 

+  -  J  [(Y]+ni)wu  +  (Y2+h2)uw]  dt 


where 


“11  T“l2 

l21  b22 


I'll  M.2 
L21  L22 


1*11  ■  I'll 


+  PI2  -  ai2  75  »  l22  “  l22  -  b22  --j  +  p22  -  a22  —5  (20) 


L12  *  l12  “  -  ~  *  1*21  *  t21 
9x 


IV.  THE  SYMMETRICAL  ADJOINT  SYSTEM.  The  adjoint  independent  variable  r 
in  Figure  1  can  be  expressed  as 


Tb_~  -  tp 

Tb  To  tb  ~  fco 

(28) 

which  gives 

and 

t  -  tb  for  t  *  t0 

(29a) 

t  =  x0  for  t  =»  tb 

(29b) 

It  is  noted 

from  Eq.  (28)  that 

Tb  “  To  “  Tb  "  to 

(30a) 

T  -  Tb  +  t0  -  t 

(30b) 

dr  =*  -dt 

(30c) 

d  d 

dr  dt 

(30d) 

and 

y(x,t)  «  y(x,r  -  rb+t0-t) 

(30e) 

Let  us  assume  that  the  adjoint  system  shown  in  Figure  1  gives 

A  A 

y(x,t-t)  -  y(x,t*tb+t0-t) 

A  A 

(31a) 

yt(x,t-t)  »  -yt(x,t«tb+t0-t) 

A  A 

(31b) 

A 

yx(x,t-t)  -  yx(x,t»tb-ft0~t) 

(31c) 

where  t  is  a 

dummy  variable  for  t. 

We  may  define  the  adjoint  system  as  the  image  reflection  in  the  time 
domain  of  the  original  system.  Equation  (31)  yields  the  following  known 
initial  conditions 


A  A 

y(x,t«tb)  *  y(x,t»t0) 

A  A 

(known) 

(32a) 

yt(x,t-tb)  -  -yt(x,t-t0) 

(known) 

(32b) 

The  interpretation  of  the  above  equations  gives  the  initial  conditions  of  the 
original  system  as  the  far  end  conditions  for  the  adjoint  system,  since  the 
adjoint  system  is  a  reflected  mirror  of  the  original  system  in  time. 


V.  INITIAL  CONDITIONS  FOR  THE  ADJOINT  SYSTEM.  We  take  a  symmetry 
approach  for  the  initial  conditions  of  the  adjoint  system  as 


y(x,t»tb)  »  y(x,t-t0) 

t  yt(x,t-tb) 

=  -yt(x,t»t0) 

(33) 

y(x,t=t0)  =  y(x,t-tb) 

,  yt(x,t-t0) 

■  -yt(x»t-tb) 

(34) 

where  y  and  y  are  given  in  Eq.  (10). 

Thus  Eq.  (25) 

becomes 

D1  ■  /  bi2dx{[w(x,t-to)wt(x,t«tb)  +  w(x,t-tb)wt(x,t»t0)] 

xo 


-  [w(x,t=tb)wt(x,t*t0)  +  w(x,t=*t0)wt(x,t-tb)]  } 

xb 

+  /  t>22dx{u(x,t=t0)ut(x,t-tb)  +  u(x,t=tb)ut<x,t»tc>] 

x0 


-  [u(xtt=tb)ut(x,t»t0)  +  u(x,t-t0)ut(x,t*tb)J j  *  0  (35) 

Since  the  integrand  of  Eq.  (35)  is  zero,  the  above  satisfies  Eq.  (25). 

VI.  THE  GENERALIZED  BOUNDARY  CONDITIONS.  Let  us  consider  the  operator  L 
in  Eqs.  (19)  through  (21).  It  is  assumed  that  elastic  springs  are  installed 
at  the.  ends  such  that 


yx(xb.t)  “  Kby<xb>t)  »  yx(xb»t)  -  Kby(xb,t) 
yX(xo.t)  -  “Koy(xo,t)  ,  yx(x0,t)  -  -K0y(x0,t) 


(36a) 

(36b) 


where  Kb,  Kq  are  diagonal  matrices.  If  Eq.  (36)  is  substituted  into  Eq.  (26), 
we  have 


DZ  -  0 

Since  Dl  *  DZ  *  0,  Eq.  (24)  becomes  D3  as  given  in  Eq.  (27) 

De  “  c3 


(37) 


(38) 


VII.  CONDITIONS  FOR  THE  COUPLING  TERMS.  The  sum  of  the  functionals  Ji  + 
I  is  obtained  by  adding  Eqs.  (6c)  and  (18)  as 


where 


xb  tb  -  -  1 

Jl  +  I  +  J  J  (Qy+yQ)dxdt  +  T  +  B  +  V  +  -D3 
x0  tg  2 


1  xb  -  -  -  -  tb 

T  -  J  [bi  (wtvfwtw)  +  b22(utu+utu)]  dx 

*°  tQ 

1  rfcb  .  -  -  ,  -  -  xb 

B  "  “  “  J  l«l  Ovrtwxv)  +  a2Z(uxu+uxu)]  dt 

2  t0  Xq 


(39) 


(40) 


(41) 


1  tb  -  -  Xb 

V  *  -  J  [(Yi+ni)uw  +  (Y2tT2)wul  dt 


1  tb  -  -  Xb 

D3  “  “  J  I(Yi-ni>uw+  (Y2-n2)wu)  dt 


From  the  last  two  equations,  one  obtains 


1  tb  -  xb 

V  +  -  D3  *  /  (YluwfY2wu)J  dt 

2  t0  Xq 


We  can  let  Eq.  (44)  vanish  by  choosing 


Yl  -  Y2  *  0 


which  gives 


V  +  -  D3  »  0 
2  J 


From  Eq.  (22)  one  obtains 


*>1  -  1  +  n  “  1 


(46a) 


n2  «  1  +  Y2  -  -1 


(46b) 


Thus  the  functional  for  the  original  variables  and  adjoint  variations  becomes 


Jl  -  -1  +  <Q,y>  +  <Q,y>  +  T  +  B 


i'he  sum  of  the  two  functionals  J2  +  I  is  obtained  by  adding  Eqs.  (6d)  ard  (18) 


xb  tb  -  _  1 

j2  +  I  -  J  J  (Qy+yQ)dxdt  +  T  +  B  +  V-  -D3 
x0  t0  -2 


where  X,  B,  V,  and  (1/2)03  are  given  in  Eqs.  (40)  through  (43).  By 
subtracting  (l/2)D3  from  V  we  have 

1  tb  -  -  Xb 

V - D3  *  /  ( hiuw+h2wu)  1  dt 

2  t0  Xo 


In  this  case  va  let 


hi  *  h2  *  0 


Then  from  Eq.  (22)  one  obtains 


Yi  *  -1  +  hi  «  “1 


Y2  “  1  +  h2  *  1 


C  '<ro"OrPfv'r  ■  v; 


Thus  the  functional  for  the  adjoint  variables  and  original  variations  becomes 

J2  »  -I  +  <Q,y>  +  <Q,y>  +  T .+  W  (53] 

which  gives  the  same  form  as  shown  in  Eq.  (47). 

VIII.  THE  FIRST  VARIATION.  By  taking  the  variations  6y  and  <$y 
separately,  we  let 

SJ  *  6j(6y)  +  6J2( 6y)  -  0  +  0  (54] 

Then  one  obtains  from  Eqs.  (40),  (41),  and  (47)  that 

SJl(6y)  *  -6l(6y)  +  //  Q6y  dxdt  +  6T(6y)  +  6B(6y)  “  0  (55] 


where 


-  l  xb  -  Sb 

6T(6y)  «  -  /  (bj_2(wt6wfwdwt)  +  b22(ut6u+u6ut.)]  dx 

2  Xq  tQ 

1  tb  -  -  -  -  xb 

6B(6y)  *  -  -  /  [aiz(wx4wfw6wJt.)  +  a2z(ux6u+u6ux)]  dt 


and  -6l(6y)  can  be  derived  from  Eq.  (18)  with  Yl  *  Y2  *  0  and 

X|j  t  |j  M  • 

-6l(6y)  »  -/  /  {(bi^j-dwt  -  pi2w5w  -  ai2wxSwx]  + 

*o  to 

[b22utfiut  “  p22u$u  -  a22ux5ux]  +  [ux6wwx6u]  Jdtdx 
The  second  term  on  the  right  side  of  Eq.  (54)'  is 

4J2(<$y)  “  -6l(Sy)  +  //  Q5y  dxdt  +  6T(6y)  +  6B(6y)  »  0 


where 


1  *b  ,  -  -  tb 

6T(6y)  ■  -  J  [bi^(vSwi+v^6w)  +  t^^uSu^-Hit 6u)]  dx 

2  *o  tQ 

1  tb  -  .  -  *b 

6B(6y)  «  -  /  taiz(w6wx+wx6w)  +  a2z(u6u*Hix6u)]  dt 


and  6l(6y)  can  be  derived  from  Eq.  (18)  with  Tty  *  Tfc  ■  0  and  -Yl  *  Y2 


*b  tb  -  -  - 

-6l(6y)  ■  -  J  J  {[bi"Wt^wt  -  pi'tjSw  -  ai^jjdwjj]  + 
xo  t0 

[b22ut4ut  “  P22u6u  “  a22ux5ux]  +  [-wx6u  +  ux6w] Jdtdx 


(62) 


The  adjoint  equation  has  the  same  form  of  the  original  equation  by  dropping 
and  adding  the  bars  simultaneously  on  every  variable. 

Equations  (55)  through  (58)  are  the  key  equations  to  be  used  for  the 
finite  element  method.  It  is  noted  that  the  first  variation  $Ji(6y)  is  the 
same  as  the  first  variation  SJ2( 6y)  by  adding  or  dropping  the  bar  on  top  of 
the  variables  and  their  variations.  We  do  not  need  to  solve  for  the  adjoint 
system  in  Eqs.  (39)  through  (41)  since  they  give  exactly  the  same  solutions  as 
that  of  the  original  syscem. 

IX.  CONCLUSIONS.  The  functional  in  bilinear  matrix  vector  form  is 
symmetrical  about  the  original  variables  and  the  adjoint  variables.  The  Euler 
Lagrange  equations  for  the  coupling  systems  are  derived  using  the  fundamental 
lemma  of  the  calculus  of  variations.  By  integrating  the  bilinear  matrix 
vector  expression  by  parts,  one  can  obtain  the  bilinear  concomitant  in  terms 
of  initial  and  boundary  terms.  The  adjoint  system  can  be  arranged  in  a  manner 
that  it  is  a  reflected  mirror  of  the  original  system  in  time.  Thus  the 
initial  conditions  for  the  bilinear  concomitant  become  zero.  Generalized 
boundary  conditions  using  many  types  of  "springs"  relating  the  various  spatial 
partial  derivatives  can  be  defined  to  satisfy  the  boundaries  of  the 
concomitant.  Algorithms  are  developed  for  use  in  the  finite  element  method  by 
taking  the  first  variations  of  the  functional.  There  algorithms  are 
simplified  because  the  adjoint  system  gives  exactly  the  same  solutions  as  that 
of  the  original  system. 
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APPENDIX 


The  wave  equation  In  rods  due  to  T.  W.  Wright  is  given  as  the  following 
system 


2X 

*55  + 


C1 


“5  WTT 


(A-la) 


X+y  X  c2 

U55  -  [6 - u  +  4  -  W£j  -  — 5  uTT 

p  y  C2Z 


which  can  be  transformed  to 


X+2  y  Xf2  y 

('n  Wtt  ‘  17  u%%)  "  Ux 


y  cz  y  2(i+y) 

l  w::  +  (r:  ~2  «tt  -  77  uxx  +  —7 —  «) 

4X  c2  4X  X 


With  appropriate  group  of  parameters,  we  have  the  following  form 

biVt  -  ai^xx  +  pi2*  ~  ux  »  0 

wx  +  b22utt  -  a22uxx  +  P22u  «  0 


{ 


(A-lb) 


(A-2a) 

(A-Zb) 

(X-3a) 

(A-3b) 


The  above  system  of  equations  can  be  expressed  by  a  matrix  vector  form  of 
equations  as 


L11 

M.2 

L21 

L22 

0 

0 


(A-4) 


where 


a2 


32 


L*1  "  ^  S?  “  *!2  S?  +  P!‘  *  Ll2“--£ 


(A-5) 


3  32  32 

L21  =“  —  and  L22  -  b22  ~2  -  a2Z  1-5  +  P2^ 
3x  3t  ox* 


(A-6) 


which  can  be  written  as  Eq.  (1)  in  the  text. 


m 


m 


fM 

m 

m 

M 


The  notations  for  the  wave  equation  in  rods  are: 
w  *  axial  displacement 
u  ■  radial  strain 

K  »  z/a  *  nondimens ional  axial  coordinates 
t  »  ct/a  »  nondimens ional  time 
a  »  elastic  stored  energy  per  unit  length 


ci  *  / (A+2n)/P  •*  longitudinal  wave  speed 
C2  *  / y/p  *  shear  wave  speed 
X  and  u  are  Lame^  constants 

The  above  system  of  equations  is  first  derived  in  different  form  by 
Mindlln  and  Herrmann  and  can  be  grouped  into  a  single  equation  as: 


32 


;2  32 


a2 


XHi  Cb^  32 

l(i?  ‘  "  I?  ~ s  5  ( 


2  2 


U  Cl2  3£2  Cb2  q>2 


)(w  or  u) 


(A-7) 
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REPRESENTATION  OF  TWO-PHASE  FLOWS  BY  AVERAGING 
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Ballistic  Research  Laboratory 
Aberdeen  Proving  Ground,  MD  21005 


ABSTRACT 

We  consider  descriptors  of  gas-particle  aggregates  which  represent  space 
averaged  local  properties  of  the  aggregate.  We  show  that  such  descriptors 
have  undulations  due  to  the  finite  size  of  the  averaging  volume,  and  derive  an 
estimate  for  the  bounds  of  the  amplitude  of  the  undulations.  According  to 
that  estimate,  one  obtains  reasonably  accurate  averages  only  if  the  averaging 
volume  contains  at  least  30-150  particles.  In  terms  of  the  size  of  the 
averaging  volume  this  limitation  means  that  the  diameter  of  the  volume  should 
equal  at  least  four  to  five  mean  distances  between  particle  centers.  A 
consequence  for  the  modeling  of  two-phase  flows  through  tubes  is  that  space 
averaged  descriptors  cannot  resolve  the  radial  structure  of  suich  flows  unless 
the  mean  particle  distance  is  much  smaller  than  one  tenth  of  the  tube 
diameter.  This  condition  excludes,  for  instance,  the  use  of  average 
descriptors  for  radial  flow  resolution  of  some  interior  ballistics  flows, 

where  the  particulate  phase  consists  of  propellant  grains.  We  also  derive  an 
approximate  characterization  of  the  boundary  of  a  particle  aggregate  for 
different  sizes  of  the  averaging  volume. 

1.  INTRODUCTION 

Properties  of  gas-particle  mixtures  commonly  ere  characterized  in  terms 
of  descriptors  which  represent  space  averages  over  a  finite  volume.  For 
instance,  one  way  define  the  rarticle  temperature  at  any  point  of  the  mixture 
as  the  average  temperature  of  all  particles  within  a  finite  neighborhood  of 
the  point,  and  use  a  similar  definition  for  average  gas  properties  by 

averaging  the  local  gas  properties  over  a  volume  representing  the 

neighborhood.  In  order  to  obtain  reasonable  averages,  the  averaging  volume 
should  be  large,  for  Instance,  larger  than  the  particles.  On  the  other  hand, 
by  averaging  over  a  large  volume,  one  loses  information  about  local  flow 
structures,  for  instance,  the  boundaries  of  particle  aggregates  are  washed 
out.  Therefore,  one  wants  the  averaging  volume  to  be  small.  One  may  find  a 
reasonable  compromise  between  the  contradicting  requirements  if  the 
requirements  are  quantitated.  Such  a  quantitation  is  the  purpose  of  the 
present  paper. 

We  approach  the  problem  by  investigating  the  undulations  of  the  gas 
volume  fraction  a  in  the  averaging  volume  in  terms  of  the  distribution  cf  the 
particles,  and  of  the  size  and  location  of  the  averaging  volume.  The 

Investigation  produces  a  bound  for  the  undulations.  By  specifying  a  tolerance 
level  for  the  undulations,  one  can  use  this  bound  to  obtain  a  minimum  size  of 
the  averaging  volume  which  would  insure  that  undulations  of  a  are  below  the 
tolerance  level.  Undulations  of  other  flow  descriptors  are  shown  to  be 
proportional  to  the  undulations  of  a. 
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The  smoothing  effect  of  volume  averaging  on  flow  structures  is 
illustrated  by  a  representation  of  a  particle  aggregate  boundary  by  the  value 
of  a.  In  concept,  such  a  boundary  consists  of  .  a  narrow  transition  zone 

between  regions  with  a=l  (gas  only)  and  o=5  (the  average  gas  volume  fraction 
In  the  mixture  region).  One  finds  that  the  width  of  the  transition  zone, 
which  equals  about  one  diameter  of  the  averaging  volume  is  not  narrow  in 
comparison  to  a  mean  distance  between  particles.  The  transition  profile  can 
be  represented  by  an  approximate  formula. 

In  Section  2,  we  present  and  discuss  the  principal  results  of  the 

investigation,  i.e.,  the  estimated  bound  of  undulations  of  a.  Discussions  of 
some  properties  of  the  gas  volume  fraction  o  are  presented  in  Section  3,  which 

also  contains  an  outline  of  the  derivation  of  the  bound.  The  representation 

of  a  particle  aggregate  boundary  in  terms  of  a  is  discussed  in  Section  4. 
Section  5  contains  a  concluding  discussion  of  the  results. 

2.  PRINCIPAL  RESUiJS 


Let  the  averaging  volume  be  a  sphere  with  radius  R  and  let  the  particles 
be  spheres  with  radius  s  arranged  in  a  three-dimensional  lattice.  The  gas 
volume  fraction  a  in  the  averaging  sphere  is  generally  different  for  different 
positions  of  the  sphere.  However,  the  limit  value  of  o  as  R  becomes  infinite 
Is  independent  of  the  location  of  the  center  of  the  averaging  sphere,  only 
depending  on  the  lattice  and  on  the  particle  radius  s.  Let  the  limit  value  be 
5.  We  define  a  mean  distance  Lffl  between  particle  centers  by  the  equation 

lm  -  2s(l-5f1/3.  (2.1) 

A  motivation  for  this  definition  is  given  in  Appendix  B,  where  it  is  also 
shown  that  in  general,  Lm  is  10-24  percent  larger  than  the  smallest  distance 
between  particle  centers,  i.e.,  between  the  lattice  points  of  the  particle 
arrangement. 

The  difference 

Ao  *  o  -  a  ,  (2.2) 


between  the  gas  volume  fraction  in  the  averaging  sphere  and  the  limit  value  of 
o  depends  or.  the  size  and  position  of  the  sphere  as  well  as  on  5  and  the 
particle  arrangement.  In  Section  3,  we  show  by  sample  calculations  that  the 
magnitude  of  AS  is  bounded  by 

M  <  0.5(l-5)SZ(Lm/R)2.  (2.3) 


independently  of  the  position  of  the  averaging  sphere.  The  bound  Eq.  (2.3) 
has  been  tested  by  calculations  for  four  different  lattices  (defined  in 
Appendix  A)  and  for  1^R/Lms4  and  0.5<a<0.9.  Because  the  formula  has  the 
correct  limit  A5=0  for  5*1,  it  can  be  used  as  an  estimate  for  all  520.5. 
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Extrapolations  to  smaller  values  of  a  should  be  done  with  caution  and  the  same 
applies  to  extrapolations  to  R/Lm>4.  The  domain  R/Lm<l  is  of  little  practical 
interest,  because  there  the  undulations  are  too  large. 

Let  I Aaltol  be  an  acceptable  tolerance  level  for  the  undulations.  Then 
one  can  reformulate  £q.  (2.3)  as  a  condition  for  R/Lm  obtaining 


R/Lm  >*5[0.5(l-«)/|A5|tcl]  . 


(2.4) 


V  R/Lm  satisfies  Eq.  (2.4),  then  the  undulations  of  o  are  less  than  1 A5I td- 

If  5  is  close  to  one,  i.e.,  if  most  of  the  volume  is  occupied  by  gas, 
then  it  is  reasonable  to  choose  a  tolerance  level  that  is  proportional  to  T-5, 
instead  of  a  constant  level.  For  instance,  the  tolerance  level  could  be 


l“lt0l  for  5  i  V 

[(l“o)/(l-at)]|flo)t0l  for  5  Jr  5^.. 


(2.5) 


(We  assume  that  5t  and  |A5lto1  are  such  that  |Ao|t0i/{l-ot)<l.)  The  condition 
Eq.  (2.4)  with  these  tolerance  levels  can  be  expressed  in  the  form 


R/Lm  »  a[max(l-a,  l-5t)]1/2[0.5/|4a|tol]1/2 


(2.6) 


We  restrict  for ^ simplicity  St  to  5^*0. 85  and  observe  that  for  this  range  the 
factor  of  jAa|’o^  in  Eq.  (2.6)  has  a  maximum  at  5*2/3,  a  minimum  at  5=5^,  and 
is  linearly  increasing  between  5-5tand5*i.  Hence  if  5  is  not  known,  then 
Eq.  (2.6)  should  be  used  with  5*2/3,  and  If,  for  instance,  5  is  known  to  be 
larger  than  5t,  then  Eq.  (2.6)  should  be  used  with  5*1. 

Figure  1  is  a  graphical  display  of  Eq,  (2.6)  for  5t*0.9.  It  shows,  for 
instance,  that  for  |A5jt0^  *0.01  the  averaging  sphere  radius  should  be  larger 
than  2.7L  If  a  is  not  known9  and  larger  than  2.2Lm  if  5  is  known  to  be  larger 
than  0.867.  The  solid  lines  in  the  figure  indicate  the  domain  in  which  sample 
calculations  have  been  made.  Extrapolations  are  Indicated  by  dashed  lines. 

The  relation  Eq.  (2.3)  also  can  be  expressed  in  terms  of  the  number  N  of 
particles  In  the  averaging  volume.  We  arrive  at  such  ar.  expression  by  using 
the  approximation  (see  Appendix  B) 


N  %  (2R/LJ3 

Ml  • 


(2.7) 
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Figure  1.  Averaging  Sphere  Radius  for  Given  Tolerance  and 
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Substituting  this  expression  in  tq.  (2.3)  we  obtain 


|<£|  <  2.0(1-S)a2  N'2/3, 


(2.8) 


and  the  tolerance  condition  (2.6)  becomes 

N  >  a3[max(1-o,  l-ot)]3/^2  C 2/ 1 A5 } tQ ^  1?^ -  (2.9) 


The  tolerance  condition  (2.9)  is  displayed  in  Figure  2  for  at=0.9.  It  shows 
that  for  |A5|tol  =0.01  the  minimum  number  of  particles  in  the  averaging 

volume  is  between  65  and  160,  depending  on  5*  The  condition  has  been  tested 
by  calculations  with  N  between  3  and  512.  Extrapolation  beyond  the  tested 
range  is  indicated  in  figure  2  by  dashed  lines. 

The  undulations  of  averages  of  flow  descriptors  other  than  a,  such  as  gas 
density,  are  closely  related  to  the  undulations  of  a.  We  investigate  this 
relation  in  Appendix  C  and  show  by  examples  that  the  amplitude  a$  of  particle 
induced  undulations  of  a  descriptor ♦  is  proportional  to  Ac*.  If  ^ has  a  constant 
gradient  and  changes  by  6<j>  along  a  distance  equal  to  the  diameter  of  the 
averaging  sphere,  then 

[Aa'fe  j6$|  | Aot | / ( 3a ) -  (2.101 


If  6$=0  but  there  are  local  particle  induced  inhomogeneities,  than  the 
undulations  of  ♦  again  are  proportional  to  Aa  and  to  the  amplitude  of  the 
local  inhomogeneities.  If  d$  is  the  change  of  the  constant  average  value  of 
♦  due  to  such  disturbances  then 


|d*J  •  |Aaj/(a{l-a)). 


(2.11) 


Using  the  relations  (2.10)  and  (2.11)  one  can  obtain  tolerance  conditions  for 
R/Lm  and  N  in  terms  of  a  tolerance  level  for  any  flow  descriptor  ♦. 

Next,  we  discuss  some  consequences  of  these  findings  for  the 
representation  of  two-phase  flows  through  tubas.  In  order  to  resolve  the 
radial  structure  of  such  flows,  the  averaging  sphere  radius  should  be  a  small 
fraction  of  the  tube  diameter,  say 


R  <  cD , 


(2.12) 


where  c  is  of  the  order  0.05  or  smaller.  (With  such  a  Rt  one  can  describe  the 
flow  in  the  core  of  the  tube  and  up  to  the  distance  cD  from  the  wall.  If 
c»0.05,  then  the  averaged  descriptor  could  be  used  to  represent  gross 
features  of  the  radial  structure.)  Assuming  e.g.,  that  JA5|t  ^*0.01,  one 


pt  -  1.0  and  6.  $67.1 
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finds  from  Figure  1,  R>2.7L„  and  therefore, 

m 


Lm  <  Dc/2.7  =  DO. 02. 


(2.13). 


Hence,  In  order  to  reasonably  represent  by  averaged  descriptors  gross  features 
of  the  radial  structure  of  the  flow,  the  mean  distance  between  the  particle 
centers  should  be  less  than  0.02  of  the'  tube  diameter.  By  relaxing  the 
requirements  on  the  tolerance  level,  one  of  course  can  arrive  at  a  less 
stringent  condition.  However,  Eq.  (2.13)  likely  indicates  the  correct  order 


of  magnitude  of  an  upper  bound  for 


A  simple  consequence  for  interior 


ballistics  modeling,  where  the  particulate  phase  consists  of  propellant 
grains,  is  that  space  averaged  descriptors  are  not  adequate  for  the  resolution 
of  radial  flow  structures,  because  the  condition  (2.13)  is  typically  violated 
in  such  flows.  More  discussions  of  the  interior  ballistics  problem  are 
presented  in  Section  5. 

Volume  averaging  smoothes  and  distorts  not  only  the  undesirable 
heterogeneities  that  are  caused  by  single  particles,  but  also  other  flow 
structures,  particularly  when  they  have  extensions  less  than  a  diameter  of  the 
averaging  volume.  An  example  of  a  flow  structure  with  a  small  extension  is 
the  boundary  of  a  particle  aggregate.  Let  the  average  gas  volume  fraction  in 
the  aggregate  be  a  and  let  the  aggregate  occupy  the  half-space  z>0.  The 
conceptual  image  of  the  boundary  z-0  is  a  narrow  transition  zone  between  a 
region  witna=l  (gas  only)  and  «=S  (gas-particle  mixture).  In  fact,  the  width 
of  the  transition  zone  equals  a  diameter  of  the  averaging  volume  and  Is, 
therefore,  large  compared  to  the  mea.^  distance  between  the  particles.  If  the 
averaging  volume  is  a  sphere  with  radius  R,  then  the  transition  profile  is 
approximately  given  by  the  following  function 


if  z  s  -R, 


*(*)  =  {l-(l-fi)(l+  f)2(2-f)i,  if  -R  <  z  <  R, 


(2.14) 


if  z  >  R. 


This  function  is  an  idealization  derived  under  the  assumption  that  undulations 
of  a  are  zero  inside  the  particle  aggregate,  but  It  is  not  a  limit  curve  for 
R-»-«.  (Such  a  limit  is  a  =  (l+5)/2  for  any  aggregate  occupying  a  half-space). 
The  real  transition  profiles  are  different  for  different  trajectories  of  the 
averaging  sphere  and  undulate  around  a  curve  approximated  by  Eq.  (2.14).  As 
shown  in  Section  4,  the  amplitude  of  the  undulations  are  bounded  by  Eq.  (2.3) 
in  which  a  is  replaced  by  c(z)  from  Eq.  (2.14). 

3.  UNDULATIONS  OF  THE  GAS  VOLUME  FRACTION 

He  consider  the  gas  volume  fraction  a  In  an  averaging  sphere  with  the 
radius  k  and  center  coordinates  We  assume  that  the  particles  are  spheres 
with  radius  s  and  arranged  in  a  three-dimensional  lattice  with-  the  lattice 
constant  L.  Then  the  particle  aggregate  is  completely  described  by 
dimensionless  geometrical  parameters  and  two  length  scales,  s  and  L.  l.jt  S  be 


the  gas  volume  fraction  of  an  infinite  averaging  volume,  i.e.,  of  the  whole 
space.  Then  one  can  define  a  mean  distance  Lm  between  the  particle  centers  by 

Lm  =  2s ( 1 -a) ” t3*1) 

(A  motivation  for  this  definition  is  given  in  Aopendix  B.)  One  can  show  that 
the  ratio  L/Lm  is  for  any  given  lattice  independent  of  s  and  a.  Therefore, 

a  particle  aggregate  in  lattice  form  is  completely  described  by  dimensionless 

geometrical  parameters,  the  ratio  L/L  ,  the  value  of  a  and  a  single  length 

scale  L. 
m 

The  gas  volume  fraction  a  of  the  averaging  sphere  depends  on  the 
parameters  of  the  aggregate,  and  on  the  location  and  size  of  the  sphere.  We 
express  this  dependence  by 


!3.2) 


where  the  function  f  is  completely  determined  by  dimensionless  parameters 
describing  the  geometry  of  the  lattice.  Next,  we  investigate  the  dependence 
of  a  on  the  arguments  x/i-m  and  R/Lm.  If  R  is  very  small,  then  a  has  the  value 

one  or  zero,  depending  whether  the  center  position  X  is  inside  or  outside  a 
particle.  As  R  increases,  a  approaches  the  value  5,  independently  of  the 
value  of  x,  i.e.,  of  the  location  of  the  averaging  sphere.  The  transition 
between  these  limits  is  illustrated  in  Figures  3  and  4.  The  two  curves  in 
Figure  3  correspond  to  two  different  positions  of  the  averaging  sphere  in  a 
square  cylinder  lattice.  (Lattices  are  defined  in  Appendix  A.)  One  of  the 
positions  is  at  the  point  of  origin  which  is  a  lattice  point  and  occupied  by  a 
particle.  Therefore,  the  corresponding  curve  starts  with  o=0  and  a  starts  to 
increase  only  as  R  becomes  larger  than  s.  The  other  curve  is  for  the 
averaging  sphere's  center  position  x/'»-m=(0.4,  0.2,  0.0).  It  starts  with  the 
value  o=l.  Both  curves  approach  the  limit  value  and  undulate  about  that 
value  as  R  increases.  Figure  4  shows  two  similar  curves  for  a  different  value 
of  a  and  in  a  different  lattice.  The  center  positions  of  the  two  averaging 
spheres  are  the  same  as  in  Figure  3.  The  curves  shown  in  Figures  3  and  4  are 
typical  for  all  computed  examples.*  The  larger  undulations  about  a  have  a 
wave  length  of  the  order  of  Lm  and  an  amplitude  that  decreases  proportionally 

to  a  negative  power  of  R/Lm.  The  curves  belong  to  a  family  with  three 
parameters,  jjamely,  the  three  components  of  the  averaging  sphere’s  center 
coordinate  X/Lm.  We  are  interested  in  the  envelopes  of  the  family  of  curves. 
The  envelopes  were  obtained  numerically  (using  a  simplex  algorithm)  for 
various  values  of  a  and  for  the  four  different  lattices  described  in  Appendix 
A.  Some  of  the  results  are  shown  in  Figures  5,  6  and  7. 


* Additional  examples  are  presented  -in  A.  K.  R.  CelmivM,  "Averaging  Effects  in 
Models  of  Three- Dimensional  Two-Phase  Flaws ,  "  PEL  Technical  Report  in 
Publication. 
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Figure  5  shows  a  regular  pattern  of  the  envelopes,  but  such  a  regularity 
was  found  to  be  an  exception.  If  one  reduces  the  sizes  of  the  particles  in 
the  same  lattice,  then  one  obtains  the  envelopes  shown  in  Figure  6.  Figure  7 
shows  envelopes  for  a  different  lattice,  but  for  the  same  limit  value  5  as  in 
Figure  6.  The  envelopes  in  the  latter  two  figures  are  typical  for  all 
calculations.  They  indicate  that  positive  and  negative  deviations  from  a  have 
in  general  the  same  magnitude.  Also  comparing  results  for  different  lattices 
and  the  same  a,  one  finds  that  none  of  the  lattices  produce  consistently 
larger  or  smaller  undulations  of  a.  An  overview  of  the  trend  of  the  extreme 
values  of  the  envelopes  can  be  obtained  by  plotting  the  extreme  values  versus 
the  radius  of  the  averaging  sphere.  Figure  8  shows  such  a  plot  in  log, log- 
scale.  The  figure  displays  the  extreme  positive  and  negative  values  of  the 
envelopes  for  four  different  lattices  but  the  same  o.  The  different  symbols 
signify  different  lattices  and  different  signs  of  the  deviations.  The  line  in 
Figure  8  represents  an  estimated  bound  of  the  deviations.  The  equation  of  the 
line  is 

M  =  0.5(1-5)S2  (R/Lm)"2.  (3.3) 


The  equation  was  determined  by  comparing  plots  of  extreme  deviations  of  a  for 
different  values  of  c.  Next,  we  discuss  the  validity  of  the  bound  (3.3). 

The  bound  (3.3)  is  based  on  sample  calculations  of  envelopes  within  the 
range  1.0<R/Lm<4.0,  for  5=0.5,  0.667  and  0.9,  and  for  the  four  lattices 

described  in  Appendix  A.  Because  the  lattices  have  quite  different  symmetries 
and  because  their  maximal  packing  densities  range  from  5min=0. 260 

to  5m<„=0*476,  one  can  assume  that  the  results  are  valid  for  all  reasonable 
min 

lattices.  It  is  possible  to  construct  symmetric  particle  arrangements  with 
undulations  in  excess  of  Eq.  (3.3),  by  clustering  the  particles  in  a  periodic 
manner.  However,  one  can  debate  whether  such  arrangements  can  be  considered 
as  particle  aggregates  with  uniform  particle  number  density.  In  any  case,  the 
estimate  by  Eq.  (3.3)  is  optimal  in  the  sense  that  additional  computations  of 
examples  can  only  make  the  bound  larger.  The  limitation  of  calculations  to 
5*0.5  was  motivated  by  practical  considerations.  The  minimum  5  is  for  the 
square  cylinder  lattice  0.476.  This  means  that  values  of  5<0.5  only  can  be 
obtained  by  special  packings  of  the  particles,  while  the  results  for  5>0.5 
have  more  general  validity.  The  upper  limit  of  6  is  one  (gas  only)  and  Eq. 
v3.3)  produces  at  this  limit  the  correct  value  45  =0.  Therefore,  Eq.  (3.3) 
likely  can  be  used  for  the  whole  range  0.5<a£l.Q.  The  range  of  R/Lm  was 

limited  from  below  to  R/L  >1  because  for  smaller  R  the  undulations  become 

m 

excessively  large  (see  Figures  5,  6  and  7).  At  R/L  =4.0,  the  amplitude  of  the 

Q  III 

undulations  approaches  the  order  of  10  ,  which  can  be  assumed  sufficiently 
small  for  most  applications. 

Eq.  (3.3)  also  can  be  used  as  a  tolerance  condition  for  R/L  .  Such  an 
application  of  the  formula  is  discussed  in  Section  2.  ?!' 


4.  GAS  VOLUME  FRACTION  PROFILES 

Let  the  particle  aggregate  occupy  the  half-space  z>0,  and  let  5  be  the 
gas  volume  fraction  in  the  half-space.  We  investigate  how  a  can  be  used  as  a 
descriptor  of  the  extension  of  the  particle  aggregate. 
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Figure  5.  Envelopes  of  Gas  Volume  Fraction  Curves 
Square  Cylinder  Lattice,  a  =  0.5 


Figure  6.  Envelopes  of  Gas  Volume  Fraction  Curves 
Square  Cylinder  Lattice,  5  =  0.9 


If  the  averaging  sphere  is  placed  outside  the  aggregate,  then  one  obtains 

the  value  0=1,  For  positions  of  the  sphere  inside  the  aggregate,  a  is 

approximately  equal  to  5.  The  boundary  z=0  of  the  aggregate  is  signified  by  a 

transition  a(z)  from  one  to  5.  Such  a  transition  is  illustrated  in  Figure  9. 

For  the  examples  shown,  the  particle  aggregate  is  a  lattice  with  particles 

placed  only  at  those  lattice  points  which  have  a  non-negative  z-coordinate. 

One  can,  therefore,  define  as  aggregate  boundary  the  plane  z=-s  (tangential  to 

the  extreme  particles).  The  four  calculated  transition  profiles  correspond  to 

two  averaging  spheres  with  radiuses  R=Lm  and  R=2Lm,  respectively,  and  two 

trajectories  for  each  sphere.  One  trajectory  is  the  z-axis  and  the  other 

trajectory  is  the  line  x/L  =0.5,  y/L  =0.5. 

m  m 

One  observes  inside  the  aggregate  thj  expected  periodic  undulations  about 
5.  The  amplitude  of  the  undulations  is  bounded  by  Eq.  (2.3),  and  the  wave 
length  is  about  1.5Lm.  (For  other  examples,  the  wave  length  was  found  to  be 

as  low  as  i.OL  .)  In  the  transition  zone,  the  undulations  are  about  a 
m 

transition  curve.  This  transition  curve  may  be  approximately  computed  by 
assuming  that  the  gas  volume  fraction  equals  5  in  those  parts  of  the  averaging 
sphere  which  are  inside  the  aggregate.  (Beyond  the  plane  z=-s  in  the  present 
example.)  The  maximum  slope  of  this  approximation  is  -(l-a)(3/2)(Lm/2R),  and 
a  formula  for  the  curve  a(z)  is  given  by  Eq.  (2.14).  One  may  estimate  the 
undulations  in  the  transition  zone  by  Eq.  (2.3)  in  which  5  is  replaced  by  the 
appropriate  value  a (z).  Test  calculations  confirm  the  validity  of  this 
estimate.  However,  because  the  aggregate  boundary  introduces  a  new  structure 
into  the  problem,  systematic  deviations  are  possible  of  a(z)  from  the  “ideal“ 
transition  curve  without  undulations.  This  is  illustrated  in  Figures  10,  11 
and  12. 

Figure  10  shows  envelopes  of  the  transition  profiles  of  Figure  9  with 
R/Lm=1.0.  The  envelopes  were  numerically  determinea  by  a  simplex  algorithm 
whereby  the  x-  and  y-coordi nates  of  the  averaging  sphere  were  treated  as  free 
parameters.  The  figure  also  shows  the  transition  curve  a(z)  with  bounds  of 
undulations  calculated  by  Eq.  (2,3)  as  described  above.  One  sees  that  the 
difference  between  both  envelopes  is  for  all  z- values  smaller  than  the 
corresponding  difference  between  the  estimated  bounds.  However,  at  the  top  of 
the  transition  curve,  both  envelopes  are  larger  than  the  curve  o(z), 
indicating  a  positive  systematic  deviation.  A  negative  systematic  deviation 
can  be  seen  in  Figures  11  and  12,  which  show  profiles  for  the  same  particle 
arrangement,  but  for  a  different  value  of  5  and  two  different  values  of  R/Lm. 

It  appears  from  the  calculated  examples  that  bound*  for  deviations  of  a 
real  profile  a(z)  from  ot(z)  (Eq.  (2.14))  can  be  estimated  by  Eq.  (2.3), 
evaluated  at  a=a(z)  (instead  of  c=a),  and  multiplied  by  the  factor  l+Ca(z)-5), 
i.e., 

1 4a |  <  (l+of(z)-a)  0.5  (l-a(z))  aftz)  (Lm/R)2  .  (4.1) 

The  formula  has  not  been  tested  systematically.  For  constanta  =  a,  it  reduces 
to  Eq.  (2.3),  but  it  increases  the  estimated  bound  within  the  transition  zone 
by  up  to  a  factor  2-5,  thus  taking  care  of  systematic  trends  in  the  transition 
profiles. 
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Figure  9.  Gas  Volume  Fraction  Profiles  at  a  Particle  Aggregate  Boundary 

Leap-Frog  Triangular  Lattice,  5  =  0.5,  s/L  =  0.3969 

R/L  =  1.0  and  2.0  m 

m 
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Figure  10.  Profile  Envelopes  and  Estimated  Bounds  of  Gas  Volume  Fraction 
Leap-Frog  Triangular  Lattice,  5  =  0.5,  R/L^  =  1.0 


5.  DISCUSSIONS  OF  RESULTS  - 


The  sample  computations  presented  in  Sections  3  and  4  show  that  the  gas 
volume  fraction  a  in  an  averaging  sphere  with  radius  R  is  a  function  of  the 
sphere's  location  even  in  a  uniform  particle  aggregate.  In  such  an  aggregate, 
the  function  undulates  about  a  limit  value  5  =  1 im  a(R)which  is  independent  of 

R-*oo 

the  sphere's  location.  The  wave  length  of  the  undulations  is  up  to  1.5Lm, 
where  is  a  mean  distance  between  particles.  The  amplitude  of  the 

undulations  is  bounded  by  Eq.  (2.3).  Using  Eq.  (2.6)  or  Figure  1,  one  can 
determine  the  minimum  size  of  the  averaging  sphere's  radius  R  for  any 
particular  application.  For  instance,  one  finds  that  for  a  tolerance  level  of 
0.01  for  the  undulations,  R/l_m  muse  be  larger  than  2.7,  and  for  a  tolerance 

level  of  0.03,  R/Lm  must  be  larger  than  1.5.  We  assume  that  the  minimum  of 

the  radius  is  determined  to  be  R^  =rL„,  and  discuss  some  consequences  of  such 
a  restriction.  nnn  111 

First,  we  notice  that  any  two-phase  flow  structures  with  extensions  less 

than  R  .  will  be  reduced  in  amplitude  and  stretched  out  to  a  size  of  2R„^„  or 
mm  r  mm 

larger.  (See,  for  instance,  the  particle  aggregate  boundary  representation  by 
a  in  Section  4.)  Consequently,  a  complete  and  accurate  representation  of  the 
flow  field  can  be  done  in  a  mesh  with  a  mesh  constant  0.5R  .  .  Any  mesh 

refinement  can  be  done  by  interpolation  in  such  a  net,  and  a  finer  net  does 

not  provide  a  more  accurate  description  of  the  flow.  The  same  applies  to 

measurements  in  a  two-phase  flow  field.  It  suffices  to  present  such 

measurements  at  intervals  of  0.5R„.  .  Ideally,  local  measurements  should  be 

mm 

made  in  a  finer  mesh,  and  the  appropriate  average  descriptors  obtained  by 
averaging  over  the  averaging  sphere.  This  also  applies  to  measurements  near 
boundaries,  where  local  measurements  should  be  averaged  and  can  be  presented  in 
a  coarse  mesh.  Boundary  values  of  average  descriptors  are  not  necessarily 
equal  to  the  local  boundary  values  and  a  mesh  refinement  in  presentation  of 
the  results  is  r.ot  needed.  (Mesh  refinement  might  be  needed  for  t  ie 

observations,  however.) 

Similar  consequences  can  be  drawn  for  the  computation  of  a  two-phase 
flow,  for  instance,  by  solving  average  flow  equations.  A  reasonable  computing 
mesh  constant  is  of  the  order  of  0*5Rm^n»  so  that  flow  structures  extending 
more  than,  say,  R  -n  can  be  represented.  Any  refinement  of  the  mesh 
below  0«5Rm^n  has  the  effect  of  interpolation.  If  flow  structures  extending 
less  than  R  appear  in  the  solution  (using  a  fine  mesh)  then  they  should  be 

interpreted  a^  numerical  artifacts  or  noise,  because  they  cannot  be 

interpreted  as  average  flow  properties. 


When  averaged  descriptors  are  used  to  represent  such  transient  two-phase 
flows  In  which  a  approaches  one  in  some  parts  of  the  flow  field,  then  one 
should  distinguish  between  the  possible  causes  for  a  approaching  one.  If  the 
reason  for  the  disappearance  of  the  particles  is  a  reduction  of  their  sizes 
(for  instance,  by  combustion),  then  the  condition  R>Rnrin=rLm  not  violated 

U-m  does  not  change.)  Therefore,  flow  regions  with  a»l  can  be  represented 
by  the  same  average  descriptors  as  the  rest  of  flow  field.  If,  however,  a 
approaches  one  because  particles  diffuse  from  the  mixed  phase  region,  then  the 


condition  ft>Rm^n=rLm  eventually  will  be  violated  (Lm  increases  due  to  the 

diffusion).  When  that  happens,  the  tolerance  level  of  the  undulations  is  not 
guaranteed  and  as  a  result  the  representation  by  averaged  descriptors  becomes 
less  accurate.  A  better  approach  in  this  case  is  to  model  the  rarified  parts 
of  the  aggregate  by  some  other  method  than  averaging,  for  Instance,  by 
individually  tracing  the  diffused  particles. 


We  consider  as  an  example  the  situation  in  an  interior  ballistics  two- 
phase  flow  where  the  particulate  phase  consists  of  propellant  grains.  Let  the 
chamber  volume  of  the  gun  be  WQ  and  the  volume  of  the  barrel  be  W.  At  the 
time  when  the  projectile  exits  from  the  barrel,  the  volume  available  for  the 
gas-particle  mixture  is  WQ+W.  Let  the  number  of  propellant  grains  be  m.  Then 
at  the  beginning  of  the  firing  cycle  the  mean  distance  between  particle 
centers  is  (see  Appendix  B) 


■(!*)' 


(5.1) 


and  at  muzzle  time  the  mean  distance  is 


U  Wo  +  w\l/3 

\n  m  / 


(5.2) 


Hence, 


KV" 


(5.3) 


Let  the  initial  gas  volume  fraction  in  the  gun  be  aQ,  and  the  initial  particle 
radius  be  sQ.  Then  Lm0=2s0  (l-aQ)  1/3  and 


L  =  2s  (1  + 


(5.4) 


Typical  for  interior  ballistics  is  aQ  between  0.4  and  0.6  and  W/WQ  is 
approximately  equal  to  10.  Therefore,  the  maximum  of  Lm/ ( 2s Q )  is  for  a 
typical  gun  between  2.64  and  3.02.  Hence,  in  order  to  represent  the  whole 
transient  firing  cycle  by  averaging  descriptors,  one  has  to  use  an  averaging 
sphere  with  the  radius  of  about  2.7L  ,  or  eight  initial  particle  diameters. 

For  most  guns,  the  diameter  of  such  an  averaging  sphere  is  of  the  same  order 
as  the  caliber  of  the  tube.  This  mea<>s  that  the  radial  flow  structure  cannot 
be  represented  by  space  averaged  descriptors.  Such  descriptors  only  can  be 
used  by  averaging  over  cross-sectional  segments  of  the  tube,  in  which  case  one 
obtains  the  representation  of  the  core  flow. 


Appendix  A 


LATTICES 

We  describe  in  this  appendix  the  four  lattices  which  were  used  to  define 
particle  positions  for  calculations  of  gas  volume  fractions. 

1.  Square  Cylinder  Lattice.  We  construct  the  lattice  &v  first  arranging 
the  particles  in  a  square  mesh  with  the  mesh  constant  L  in  the  x,y-p1ane,  and 
then  translating  the  mesh  by  multiples  of  L  in  the  z-direction.  Each  square 
thereby  generates  a  square  cylinder.  In  this  lattice,  each  particle  has  six 
neighbors  at  the  distance  L. 

2.  Triangular  Cylinder  Lattice.  The  lattice  is  constructed  by  first 
arranging  the  particles  in  the  x,y-p1ane  in  an  equilateral  triangle  mesh  with 
the  mesh  constant  L,  and  then  translating  the  mesh  in  the  z-direction  by 
multiples  of  L.  Each  triangle  thereby  generates  a  triangular  cylinder.  The 
number  of  neighbor  particles  at  distance  L  is  for  this  lattice  eight. 

3.  Leap-Frog  Square  Lattice.  This  lattice  is  constructed  by  starting 
with  ?  square  mesn  in  the  x,y-plane,  with  the  lattice  constant  L  and  the  sides 
of  the  squares  parallel  to  the  axes.  Then,  the  mesh  is  translated  by 
multiples  of  L//2“  in  the  z-direction  and  by  multiples  of  L/2  in  the  x-  and 
y-directions.  Thus,  the  pattern  is  translated  in  a  leap-frog  manner  from  one 
z-plane  co  the  next.  Each  particle  in  this  lattice  has  \2  neighbor  particles 
at  distance  L. 

4.  Leap-Frog  Tri angular  Lattice.  The  lattice  is  constructed  by  first 

arranging  the  particles  fn  an  equilateral  triangular  mesh  in  the  x,y-plane, 
with  the  mesh  constant  L  and  one  side  of  the  triangle  parallel  to  the  x-axis. 
Then  the  mesh  is  translated  in  the  z-direction  by  multiples  of  and  in 

the  y-direction  alternatively  by  +L//3-.  Thus,  the  triangular  mesh  is  shifted 

in  a  leap-frog  manner  from  one  z-plane  to  the  next.  The  number  of  neighbors 
at  distance  L  from  any  particle  is  12  in  this  lattice. 

The  minimum  value  of  the  gas  volume  fraction  (closest  packing  of  spheres) 
is  obtained  in  the  four  lattices  by  setting  the  particle  radius  s  equal  to 
L/2.  The  numerical  values  of  amin  are  as  follows: 

Square  Cylinder  5  .  =  1  -  »/6  *  0.476 

min 

Triangular  Cyliner  om-n  =  1  -  w/ (3*^jT")  *  0.395 

Ldftp~Frog  Lattices  =  1  -  «/(3/r“)  =  0.260 

Both  leap-frog  lattices  are  arrangements  with  closest  packing  *  spheres  in 
three  dimensions. 

The  relation  between  5  and  s/L  is  for  all  four  lattices  given  by 

5  * 1  -  oe-Wfs/L)3.  <A.i) 


and  the  relation  between  L  and  the  mean  distance  L  is 

m 


L/L  =  (1-5  .  ) 
m  ram' 


(A.2) 


(See  Appendix  B.)  Substituting  the  values  of  S  in  Eq.  (A.2)  one  finds 

that  the  value  of  L/L  is  between  1.24  (square  cylinder  lattice)  and  1.10 
(leap-frog  lattices).  m 


m 


'i  * 


Appendix  B 


Mean  Distance  and  Number  of  Particles 

Let  m  be  the  number  of  particles  in  an  aggregate,  W  be  the  volume 
occupied  by  the  aggregate,  and  v  be  the  volume  of  each  particle.  Then  one  can 
conceptually  assign  to  each  particle  the  fraction  W/m  of  the  aggregate  volume, 
and  represent  the  fraction  as  a  virtual  sphere  with  the  distance  Lm»  This 
diameter  we  define  as  the  mean  distance  between  the  particle  centers.  It  is 
given  by  the  formula 


W  i,  3 

m  "  6  Lm  * 


(B.l) 


The  gas  volume  fraction  5  of  the  aggregate  volume  W  is  related  to  Lm  by 


•,  -  _  mv  _  6v  ,  -3 

1  -  a  =  tt  —  • 

W  ir  m 


(B.2) 


If  the  particles  are  spheres  with  the  radius  s,  then  one  obtains  from  Eq. 
(B.2) 


1  -  S  =  8( s/L  )3. 


(B.3) 


The  minimum  value  of  5  is  obtained  if  s  is  a  maximum.  For  the  four  lattices 
defined  in  Appendix  A,  the  maximum  value  of  s  is  L/2.  Therefore,  one  obtains 
from  Eq.  (B.3)  for  the  four  lattices 


1  -  5  .  =  (L/L  l3 

min  '  '  m'  * 


(B.4) 


If  the  averaging  volume  V  is  a  sphere  with  radius  R  and  N  is  the  number  of 
particles  within  the  sphere,  then  one  obtains  from  Eq.  (B.l)  the  approximation 


* K  »<*>  •  f  l;3  ■  '2R/IV5- 


(B.5) 


The  approximation  is  due  to  the  definition  of  Lm  as  a  mean  distance  for  the 
whole  particle  aggregate,  whereas  Eq.  (B.5)  is  for  the  number  N  of  particles 
in  V(R)  which  is  only  a  fraction  c'  W.  If  the  aggregate  occupies  the  whole 
space  than  Eq,  (B.5)  is  exactly  valid  at  the  limit  R**»  : 


lim  /. 

“  R-®  V 


2R/N(R)1/3)  . 


(B.6) 
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Appendix  C 


Undulations  of  Average  Descriptors 

Let  7  be  a  local  gas  property,  for  instance,  density.  Then  the 
corresponding  average  descriptor  $  is  defined  by 


*r-  /* 


(C.l) 


where  V„__  is  that  part  of  the  averaging  volume  V,  which  is  occupied  by  gas. 

_/v  ^ 

If  <t>  is  constant,  then  QH  for  any  positive  value  of  V„  =aV.  If  6  is  not 

gas 

constant,  then  the  distribution  of  particles  in  V  does  affect  the  value  of  the 
average  descriptor  <P.  We  estimate  the  influence  of  undulations  of  the  gas 
volume  fraction  a  on  the  value  of  <p  in  the  case  where  ?  is  a  function  with  a 
constant  gradient. 

Let  $  be  the  function 


?(x,y,z)  =  <pQ  + 


(C.2) 


with  constant  <PQ  and  <{>x.  Let  the  particle  volumes  v  be  small  compared  to 
the  averaging  volume  V  and  let  x.  be  the  x-coordi nates  of  the  centers  of  the  N 
particles  that  are  in  V.  Then  the  average  descriptor  *  can  be  approximated  by 


♦(x)  =  [(Vx*x)V-£  (Wx)vj  = 


(C.3) 


where  t  is  the  average  of  the  deviations  x-x  of  the  particle  positions  x- 
from  the  center  of  x  of  V.  1  1 

Next  ,  we  relocate  the  averaging  volume  to  the  position  x+  Ax.  The 
relocation  will  in  general  change  the  number  of  particles  in  V.  Let  AH  be  the 
number  of  particles  that  are  added  (scooped  up)  by  the  relocation,  and  Am  be 
the  number  of  particles  that  are  lost  by  the  relocation.  The  total  number  of 
particles  in  V  is  changed  by  the  relocation  by 

AN  =  AM  -  Am,  , 

(C  .4) 


and  the  gas  volume  fraction  changes  by 


(C.5) 


Aa  =  -V  .  -(l-a)AN/N. 


The  new  value  of  the  the  average  descriptor  is,  within  the  same  approximation 
as  Eq.  (C.3), 


4>(x+Ax)  =  $0  +  (x+Ax)$x 


ct+Acr  X 


[<•  /AM  Am\  n 

J"4x+s(E-Ej  (x .-x-ax)j , 


(C.6) 


We  assume  that  V  i"  a  sphere  with  radius  R.  Then  the  average  x-coordinate  of 
the  added  particles  may  be  estimated  by  X+2R/3+ Ax/2,  and  the  average  x- 
coordinate  of  the  lost  particles  by  x-2R/3+Ax/2.  The  number  of  added  and  lost 
particles  in  a  uniformly  distributed  aggregate  (no  undulations  of  a)  are 


AM  =  Am  =  (l-o)AV/v  =  N*3ax/(4R), 


(C.7) 


where  aV=*V3ax/(4R)  is  the  volume  newly  covered  and  lost  by  the  relocation. 

Next,  we  assume  that  the  particle  distribution  is  not  uniform  so  that  one 
has  an  excess  of  added  particles  over  lost  ones.  We  express  this  by  setting 


AM  =  N  3ax/(4R)  +  AN,  ) 

and  >  (C.8) 

Am  =  N  3ax/(4R).  ? 


Substituting  these  values  and  the  average  coordinate  estimates  of  added  and 
lost  particles  in  Eq.  (C.6)  one  obtains 


♦(x+Ax)  =  ^  +  (x+axHx  - 
=  ♦„  ♦  (»+**>♦„  - 


(C.9) 


The  difference  between  $(x*Ax)  and  $(x)  is,  therefore, 
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4>(x+Ax)  -  4>(x)  =  Ax$x  -  4>x2R  y  [l-  |  2R  +  V  3  2R']  •  (C*10) 


The  last  term  on  the  right-hand  side  of  Eq.  (C.10)  is  the  change  of  4>  which  is 
caused  by  an  undulation  Aa  over  the  distance  Ax.  From  Section  4,  we  know 
tnat  typical  wave  lengths  of  the  major  undulations  are  between  Lm  and  1 .5Lm. 
In  order  to  obtain  the  amplitude  of  a  corresponding  undulation  of  <i>  one  should 
set  Ax  equal  to  one-fourth  of  the  wave  length.  By  setting,  e.g.,  Ax=L  /3 
one  obtains  the  following  estimate  of  the  amplitude  of  the  undulation 

<c-"> 

In  Section  2  we  found  that  a  reasonable  value  of  R  is  such  that  Lm/4R<0.25. 
Also,  the  ratio  */R  is  likely  much  smaller  than  one.  The  expression  2R$X  is 
the  change  of  <t>  along  a  diameter  of  the  averaging  volume,  if  undulations*  are 
not  present.  Let  that  change  be  6<j>  and  let  jAa|«a  .  Then  Eq.  (C.ll)  can  be 
simplified  to 

| A4> | «  1 6<J> |  Aa/(3a).  (C .  1 2) 

Undulations  of  a  also  can  affect  such  average  descriptors  which  have  a 
zero  gradient  (6«j>=0) ,  if  the  local  property  ^  is  affected  by  the  presence  of 
particles.  One  example  of  such  a  situation  is  the  average  temperature  of  a 
gas  in  which  one  places  particles  with  a  different  temperature.  If  the  gas  is 
heated  by  conduction,  then  a  short  time  later  the  gas  temperature  will  have 
changed  in  a  zone  around  each  particle,  with  a  corresponding  change  of  the 
average  temperature.  That  average  value  undulates  with  an  amplitude 
proportional  to  the  amplitude  Aa. 

We  demonstrate  this  by  considering  a  simple  model  in  which  the  gas  has  a 
constant  local  property  4>0+8b<t>  in  a  boundary  region  with  a  volume  Av  around 

each  particle,  and  the  constant  local  property  <f>  elsewhere  in  the  flow  field. 
Then  the  average  descriptor  is  0 


♦  =  V^W  [(v-N(v+Av))$0  +  NAv{*0+5b*)J  =  *0  +  VTT1 

(C.13) 

=  $0  +  d$  , 


where  d <p  is  the  change  of  the  constant  value  4>0  due  to  the  different  local 
value  in  the  boundary  region.  If  a  changes  by  Aa,  then  the  corresponding 


j*  "JKV»rv»r  *  r.*  w  Ai.-.I.  «ik  '  VXi  XIT1£S3L£^-Clfc  IT%  U"VA-'*  U%-WV  JVl^'UU.Vl.'CaiU  4JVKJ,*-* 


change  of  <fr  Is  from  Eq.  (C. 13) 


_  ►  ,  Av  Aa  _  .,  Aa 

*♦“  ’Vt?'  "d*7rr-aj 


(C.14) 


nf  th!VSJnieref<nlt0«°^ce  that  1n  tbis  examPle  the  average  descriptor 
is  define^by  6S  ^  °0t  affected  by  undulations  of  a.  The  particle  descriptor 


*ST-  AdV* 

oar  / 


par  J 

V 


(C. 15) 


where  is  the  union  of  all  particles  that  are  located  in  the  averaging 
volume.  Let  1>Q  be  a  constant  particle  property  and  *  ++6be  a  property  in  a 

descriptor*6?!0"  WUh  the  VOlUme  ^  in$1de  eaCh  particl*‘  The"  tl»  average 


*  *  Nv  [N<v-4v)*o  +  N*v(*0+V,1  *  *„  ♦  8b* 


(C.16) 


the^verage^escriptor!!*'  °f  u"d“Utions  of  °  do  not  <"f>“ence  the  value  of 


NUMERICAL  INVESTIGATION  OF  THE  STABILITY  OF  DIFFUSION 
FLAMES  NEAR  EXTINCTION  AND  IGNITION* 
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Department  of  Theoretical  and  Applied  Mechanics 
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ABSTRACT.  A  numerical  investigation  is  made  of  the  near-ignition  and 
near-extinction  characteristics  of  chambered  diffusion  flames  for  arbitrary 
Lewis  numbers.  In  particular,  for  an  S-shaped  response  curve,  both  dynamic 
ignition  and  extinction  are  found  to  occur  at  the  turning  points. 

1 .  INTRODUCTION .  While  the  stability  of  premixed  flames  has  received 
considerable  attention  in  the  mathematical  theory  of  laminar  flames  that 
has  been  developed  in  the  last  decade  or  so,  diffusion  flames  have  almost 
been  ignored.  Even  in  the  context  of  plane  flames  subjected  to  one¬ 
dimensional  disturbances  there  is  much  to  be  done,  in  particular  when  the 
response  curve  of  steady  states  ia  S-shaped  (figure  1). 

To  be  sure,  Ma talon  &  Ludford  [1]  have  considered  (numerically)  the 
near-ignition  stability  of  chambered  diffusion  flames,  but  only  for  unit 
Lewis  numbers  Lp ,LQ  of  fuel  and  oxidant.  For  Lp,  Lq  arbitrary,  even  the 
steady  states  have  only  recently  been  determined  (Choi  [2]).  Likewise, 
Buckmaster,  Nachman  &  Taliaferro  [3]  have  determined  (analytically)  the 
near-extinction  stability  characteristics  of  the  counterflow  diffusion 
flame  for  L.  =  L.  *  1.  (The  problem  is  identical  to  that  for  a  chambered 
flame.) 


In  view  of  their  effect  in  premixed  flames,  it  is  of  some  importance 
to  consider  Lewis  numbers  different  from  1,  and  that  is  the  subject  of  the 
present  paper.  We  find  (numerically)  that  there  is  no  effect  on  near¬ 
ignition  stability  of  chambered  diffusion  flames:  whatever  the  values  of 
L_,L  ,  neutral  stability  occurs  at  the  turning  point,  which  is  therefore 
the  dynamic  ignition  point.  Dynamic  extinction  also  occurs  at  the 
(corresponding)  turning  point,  but  for  a  different  reason:  for  all  values 
of  L.,ln>  stability  persists  at.  the  turning  point.  This  conclusion  contra¬ 
dicts  Buckmaster,  Nachman  &  Taliaferro  and,  moreover,  reveals  the  existence 
of  inaccessible  steady  states  that  are  stable. 

2.  GOVERNING  EQUATIONS  FOR  NEAR-IGNITION  ANALYSIS.  As  is  shown  in 
[2],  the  equation  for  the  steady  state  near  ignition  is 

,2.  U-2  L  t 

^  +  Q„  03 


where 


ts(0)  =  ts(l)  =  0,  < 

:  is  the  temperature  perturbation  in  the  flame  zone  and  Q  is  a 


given  positive  constant.  Numerics  show  that  for  each  Q  less  than  a  certain 
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Qn(Ln. /._.)»  there  are  two  solutions;  for  Q  =  Q  there  is  one;  and  for 
^.0  u  F  U 

Q  >  Qq  none. 

From  these  solutions,  the  response 


clt 


can  be  calculated,  where  4q«  1  is  a  (known)  small  positive  constant.  It 
gives  rise  to  the  upper  and  middle  branches  of  the  S-shaped  response  curve 
sketched  in  figure  1.  The  bend  is  approached  as  Q  -*Qq  and  remote  parts  of 
the  two  branches  as  Q+  0. 


Stability  analysis  involves  the  solution  of 


2  d2<J> 


In  Ip 


dn 


Q"  (l-Oe  “ft  =  0, 


(3) 


<H0)  =  HD  =  o. 

Here  X  is  the  eigenvalue:  if  the  spectrum  has  non-negative  real  part  for 
a  steady  state  tg,  that  the  state  is  stable  to  the  class  of  disturbances 
considered;  otherwise  it  is  unstable. 

It  can  be  shown  that  an  unstable  eigenvalue,  if  it  exists,  is  real. 
Moreover,  through  asymptotic  analysis  as  Q  -►  0,  the  remote  upper  branch 
of  the  S-curve  is  found  to  be  stable  while  the  remote  lower  branch  is 
unstable.  In  the  latter  case,  there  is  just  one  unstable  eigenvalue. 

3.  NUMERICAL  RESULTS  FOR  IGNITION.  In  the  work  of  Matalon  &  Ludford 
[1]  on  ignition  for  LQ  =  =  1,  the  steady  state  is  found  by  shooting  and 

then  the  stability  problem  is  tackled  by  a  Galerkin  method.  \  different 
numerical  scheme  is  adopted  here,  in  triiich  both  problems  are  solved 
simultaneously  and  discretization  of  the  differential  operator  is  in  terms 
of  Tchebytchev  polynomials.  In  addition,  a  continuation  subroutine  is 
incorporated  to  facilitate  treatment  of  the  bending  point  and  automation 
of  the  program.  The  numercal  result  obtained  by  Matalon  for  unit  Lewis 
number  has  been  recomputed  and  confirmed. 

For  general  Lewis  number,  we  reach  the  same  conclusion  as  in  the 
special  case  Lq  ~  L p  -  1,  namely, 

i)  for  all  Iq ,  I  j.  ,  the  neutrally  stable  point  is  found  to  be  exactly 
at  the  turning  point,  (i.e.  the  static  ignition  point); 

ii)  when  an  unstable  eigenvalue  exists,  there  is  exactly  one  and  It 
is  real. 


4.  GOVERNING  EQUATIONS  FOF  NEAR-EXTINCTION  ANALYSIS.  The  near- 
extinction  steady  states  (corresponding  to  the  bottom  half  of  the 
S-response)  have  also  been  described  in  [1],  The  perturbation  temperature 
satisfies  the  differential  equation 


(5) 


f  *  -Klo'T(kl5+k2-ts)<k3«+,<4-ts)e 


and  the  boundary  conditions  ■ 

f  k1c-k3B  +  0(1) 


as  £  ■+• 


k35+k][C  +  0(1) 


where  is  a  known  positive  constant, 
k3  is  a  known  negative  constant, 
k2  =  PC,  k^  ='  QC, 

P,  Q  are  known  constant, 

B,  C  are  unknown  constants. 

Like  Q  in  section  2,  the  constant  K  is  positive.  The  numerics  determine 
exactly  two  solutions  for  each  K  greater  than  a  certain  K^(Lq,L^) »  exactly 
one  for  K  =  Kq  and  none  for  K  <  Kq.  '* 

From  these  solutions  the  two  responses 


R  =*  -6  C, 
a 

where  6  «  1  is  a  (known)  small  positive  constant,  can  be  calculated 

for  each3  K  >  KQ,  thereby  generating  the  middle  and  lower  branches  of  the 
S-shaped  response  curve  in  figure  1.  The  bend  is  approached  as  K  -*■  K 
and  remote  parts  of  the  two  branches  as  K  -*■  «.  0 


The  corresponding  stability  problem  is 


“[ — T  +  a  Lo 


-1 


+  A$ 


Ke  ^yCS^YF+yFS<*>Y04yFSy0S<*'T^  * 


$_(±®)  =  $vn(±»)  =  $vr(±°°)  **  °» 


where  y0s  =  W+W*  yFS  =  W+W 


are  the  mass  fractions  corresponding  to  the  steady  state  temperature  pertur¬ 
bation  t  .  Here  A  is  the  eigenvalue;  if  -the  spectrum  has  non-negative  real 
part  forSa  steady  state  t  ,  that  state  is  stable  to  the  class  of  disturbance 
considered;  otherwise  it  is  unstable. 


It  can  be  shown  analytically  that  the  remote  lower  branch  is  stable 
for  Lq  =  L^,.  For  remote  upper  branch,  the  steady-state  response  depends 
on  whether  k^  <  h  or  k  >  %,  but  in  either  case  we  are  unable  to  extract 
any  analytical  result  about  the  stability. 

5.  NUMERICAL  RESULTS  FOR  EXTINCTION.  Due  to  the  nature  of  the 
steady-state  problem,  it  has  to  be  treated  separately  from  the  stability 
problem,  so  that  the  numerical  procedure  is  somewhat  more  complicated  than 
that  for  ignition.  The  steady  numerics  are  based  on  shooting  for-  the 
correct  value  of  C.  Thus,  the  differential  equation  (5)  is  integrated 
backwards  under  the  boundary  condition  (fib)  for  each  estimate  of  C  until 
dt 

g 

-*■  k^  as  5  -v  -».  The  constant  B  is  not  involved.  Once  the  results 

for  two  values  of  K  are  obtained,  a  continuation  subroutine  can  be 
incorporated  as  for  ignition.  For  the  stability  problem,  the  Tchebytchev 
polynomials  are  again  used  for  the  discretization  of  the  differential 
operator. 

Figure  2  gives  the  steady-state  response  for  L  ®  1  and  various 
values  of  L^.  The  holes  in  the  curves  show  where  tne  steady  state 
changes  from  being  stable  to  unstable.  In  all  cases  the  hole  lies  on 
the  upper  branch,  implying  that  the  lower  part  of  the  S  in  figure  1, 
including  a  portion  of  the  middle  branch,  is  stable.  This  result  is 
time  for  other  values  of  Lp  also,  and  we  conclude  that 

(i)  for  all  Lq, Lp,  the  neutrally  stable  point  is  above  the 
turning  point ; 

(ii)  when  an  unstable  eigenvalue  exists,  there  is  exactly  one 
and  it  is  real. 

Note  that  the  dynamic  extinction  point  still  lies  at  the  turning  point 
and  that  there  are  inaccessible  stable  states  on  the  middle  branch  of  S. 

Conclusion  (i)  contradicts  the  recent  analytical  results  [3]  of 
Buckmaster,  Nachman  &  Taliaferro  for  Lq  -  L  =>  1,  who  found  that  the 
neutrally  stable  point  lies  exactly  at  the  turning  point.  (They  consider 
the  counterflow  diffusion  flame  instead  of  the  chambered  diffusion  flame, 
but  the  stability  problems  are  identical.)  We  are  presently  trying  to 
reconcile  our  work  with  theirs.  . 
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ABSTRACT 

An  array  of  heated  rods  is  lowered  vertically  in  a  cold  water  bath  at 
a  constant  speed  V  in  order  to  quench  them  to  obtain  desired  mechanical  pro¬ 
perties.  Relative  to  the  rods,  the  water  flows  in  a  subchannel,  is  heated, 
and  boils,  while  cooling  the  rods.  A  model  is  proposed  and  studied  which 
considers  a  one  dimensicnal  flow  in  a  subchannel.  It  is  argued  that  the  heat 
release  occurs  in  a  thin  region,  where  water  is  heated  to  boiling  conditions 
and  boils  completely  to  steam.  Above  this  boiling  layer,  steam  flows  rapidly 
against  the  friction  of  the  rod  bundle.  Below  the  boiling  layer,  the  water 
flow  is  approximately  hydrostatic.  This  results  in  the  boiling  layer  moving 
at  a  constant  speed  proportional  to  V.  The  effect  of  cross  flow  (leaking 
into  or  out  of  the  channel)  is  also  investigated,  and  the  results  discussed. 

1.  INTRODUCTION  Metal  components  can  be  hardened  by  heating,  followed  by 
rapid  cooling.  The  rapid  cooling  is  often  accomplished  by  immersing  the  com¬ 
ponents  in  a  fluid  (often  water). 

In  the  problem  we  consider,  the  component  is  a  metal  rod.  For 
manufacturing  efficiency,  a  relatively  large  array  of  heated  rods  is  lowered 
lengthwise  into  the  fluid.  The  purpose  of  the  model  derived  and  analyzed  in 
this  paper  is  to  describe  the  fluid  mechanics  in  the  subchannels  of  the  rod 
array.  In  so  doing,  we  gain  some  insight  into  the  mechanisms  involved  in  the 
process,  ar.d  their  role  in  normal  and  abnormal  operation. 

An  array  of  rods  (Fig.  1  represents  a  cross  section)  is  lowered  verti¬ 
cally  into  a  cold  quiescent  fluid  at  a  constant  speed  V.  The  rods  are  assumed 
to  be  at  a  constant  uniform  initial  temperature.  We  model  the  situation  as  a 
number  of  vertical  subchannels  which  are  interconnected  through  the  gaps 
between  the  rods.  For  simplicity,  we  shall  consider  one  subchannel,  and 
assume  that  it  consists  of  several  rods,  one  interior  flow  area,  and  a  number 
of  gaps  allowing  subchannel  fluid  and  surrounding  fluid  to  mix.  See  Figure 

2.  The  analysis  presented  herein  is  based  on  the  assumption  that  the  cross- 
flow  through  the  gaps  is  small,  and  utilizes  a  perturbation  analysis  starting 
from  the  flow  in  the  subchannel  without  cross-flow. 

The  approach  taken  is  a  modification  of  one  used  by  Lahey  and  Moody 
1977,  and  Achard,  Drew  and  Lahey  1981.  Let  us  consider  one-dimensional  flow 
in  a  vertical  channel  of  cross-sectional  area  A.  We  shall  describe  the  flow 
relative  to  a  coordinate  system  fixed  in  the  rods,  with  z=Q  defining  the  inlet 
to  the  subchannel  at  the  bottom  of  the  rods.  We  shall  assume  that  boiling 
starts  at  z  =  A^(t),  and  thc.t  by  z  =  A2(t),  the  liquid  has  boiled  away.  Thus, 
the  region  0  <  z  <  A^(t)  contains  liquid  only,  the  region  A2(t)  <  z  <  L  con¬ 
tains  steam  only,  while  the  region  A^(t)  <  z  <  A2<t)  contains  a  mixture  of 
steam  and  water*  The  boiling  boundary  z  =  A^(t)  occurs  when  the  incoming 
fluid  reaches  th6  boiling  temperature  T^.  The  dryout  point  z  =  A2(t)  occurs 
when  the  two-phase  mixture  has  absorbed  enough  heat  to  boil  all  the  liquid. 
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Boiling  is  a  very  efficient  'seat  transfer  process  (Rohsenow  and  Hartnett 
1973).  In  this  problem,  if  liquid  contacts  the  heat  transfer  surface  (i.e.  the 
rod)  and  the  surface  is  at  a  temperature  above  the  boiling  temperature,  the 
liquid  will  boil  until  the  surface  temperature  is  equal  to  the  boiling  tempera¬ 
ture.  In  contrast,  in  regions  (z  >  X2(t))  where  only  steam  contacts  the  rods, 
the  heat  transfer  is  very  low.  Thus,  the  rods  lose  large  amounts  of  heat  in  a 
short  time  in  the  region  X^(t)  <  z  <  X2(t).  This  region  is  therefore  relative¬ 
ly  thin.  We  shall  assume  that  this  region  is  negligibly  thin.  We  then  have 
X^(t)  =  X 2 ( t )  =  X(t).  We  shall  refer  to  z  =  X(t)  as  the  position  or  boiling 
heat  transfer. 

We  shall  further  assume  that  the  convective  heat  transfer  to  the  liquid 
dominant  in  the  region  0  <  z  <  X(t)  (where  T  <  Tb)  is  also  efficient,  so  that 
we  can  assume  that  the  rods  and  the  liquid  are  all  at  the  ambient  liquid  tem¬ 
perature  Tq  there. 

2.  GOVERNING  EQUATIONS  Let  us  now  uiscuss  the  equations  of  governing  the 
motion  in  the  various  regions.  First,  ve  consider  the  subcooled  region 
0  <  z  <  X(t ).  The  equation  of  conservation  of  mass  is 

Ji  =  !iw(z,t),  (1) 

3z  A 

where  j  is  the  velocity  of  the  liquid,  h  is  the  gap,  and  w  is  the  cross-flow 
velocity  (positive  into  the  channel). 

The  equation  of  conservation  of  momentum  in  the  z-direction  is 

li  +  H2  »  -  i  i£  -  g  -  I _ x  +  iUl  v  »  (2) 

3t  3z  p  3z  p*A  A  c 

where  p  is  the  pressure,  g  is  the  gravitational  acceleration,  P  is  the  fric¬ 
tional  perimeter  of  the  channel,  i.e.  the  part  of  the  channel  boundary  which 
contacts  the  rods.  Also,  x  is  the  frictional  force  per  unit  area  at  this 
boundary.  Finally,  the  last  term  represents  the  rate  of  addition  of  z-momentum 
to  the  channel  fluid  due  to  the  cross-flow.  The  cross  flow  carries  with  it 
velocity  vc.  If  w<0,  we  shall  assume  that  vc  =  V.  This  assumes  that  the  fluid 
coming  into  the  channel  is  the  quiescent  fluid  outside  the  channel.  If  w<0, 
we  shall  assume  vc  =  j . 

The  frictional  force  is  modelled  by 

x  =  fp  j2  ,  (3) 

where  f  is  called  the  Faming  friction  factor.  It  is  usually  assigned  a  value 
of  f  =  0.02,  although  its  value  depends  on  the  geometry  and  the  flow  regime 
(laminar  or  turbulent). 

As  discussed,  the  energy  equations  result  in 

Tr(z,t)  =  T  ( z , t )  =  T.  (4) 

for  the  subcooled  region. 

For  the  post  dry-out  region  X(t)  <  z  <  L,  where  L  is  the  length  of  the 
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rods,  we  assume  that  the  mass  and  momentum  balances  are  similar  to  those  for 
the  subcooled  region.  Thus,  we  have 


dps  +  ^s^s  -  h  w  o 
at  3z  AS 


(5) 


H 

a 


The  momentum  equation  is 


-i£  -  pg-Mpj_  +  *21  p_v 


(6) 


3z 


3  S 


s  c 


The  assumption  of  negligibly  small  heat  transfer  between  the  steam  and 
the  rods  i<-  .  cs  in 


>’■ 


Ts  -  % 

Tr  =  T« 

An  equation  of  state  is  needed.  We  shall  take 


(7) 

(8) 


P  *  PgRTs 


(9) 


The  boiling  occurs  in  a  relatively  thin  region  around  z  -  A(t).  It  is 
important  to  conserve  mass  and  energy  across  this  thin  region.  Consider  a 
moving  control  volume  of  thickness  Az  which  straddles  z  *  A(t).  Conservation 
of  energy  in  the  fluid  stream  requires 


i 


P  C  T  A(j 
ll  0 


dA» 

dt 


-  pcATtj  -  £t*j 
s  s  b  s  £{- 


Z«=  A  -  A* 
2 


lz»  A  + 

2 


(10) 


A  +  Az 

+  /  2  Pq(z',t)dz'  - 

A  -  Az 
2 


A  +  Az 

/  2  A 

A  -  Az 
2 


fg 


r  Adz'  »o 


Here  cj cs,  and  c-  are  the  specific  heats  of  the  liquid,  steam  and  the  rods, 

A  is  the  cross  sectional  area  of  the  rods;  q  is  the  rate  of  heat  flow  from  the 
bars  to  the  fluid  per  unit  area,  and  T  is  the  rate  of  change  of  liquid  to 
steam  per  unit  volume. 
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Conservation  of  energy  for  the  reds  gives 


pcAT(-Si) 
r  r  r  r  dt 


pc  A  T  ( -  *1) 
r  r  r  r  dt 


2=  A  -  Az 
2 


z=  A  +  M 


Az 

2 


(11) 


X  +  A* 

=  /  2  Pq  dz' 

X  -  Az 
2 


Conservation  of  total  mass  in  the  control  volume  is  given  by 


p  A(j  -  3A) 
%  £  dt 


=  p  A(j  -  SLA) 
s  3  dt 


X  -  Az 
2 
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X  +  As 
2 


Lastly,  conservation  of  mass  of  liquid  in  the  control  volume  gives 


P«A(  j  ,*• 

*  *  dt 


X  - 


Ay. 

2 


X 

/  2  r  A  dz' 

X  -  Az 
2 


(13) 


Substitution  of  11-13  into  (10),  assuming  that  T  (  X  +  ^5.) 

T  ( X  -  .^5.)  =  T  and  letting  Az  0  yields  r  2 

r  2  ■* 


T0  ' 


—  «*  j(X(t),t)  Q  , 
dt 


(14) 


where 


p*A(cgTb  +  hfg  -  Cj^Tq) 

(.orcrAr(T«„  -  T0)  +  ptA(csTb  +  hfg  -  cAT0)i  (IS) 


The  momentum  jump  condition  across  the  boiling  heat  transfer  region  is 


p(X+,t)  -  p(X~,t)  +  pV<:>I  -  !i>  -  -  fi  -  0  (16) 

8  3  3  dt  *  *  *  dt 
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Dotuui,'  \KU^rx^-U!i:iiX^^x,ra!ES2^i^^i;j;^!SZC^i^l5^V-SS5jE\SS35S5iSSS: 


The  cross-flow  velocity  at  any  level  z  is  related  to  the  pressure  drop 
across  the  gap  connecting  the  cluvinei  with  its  surroundings.  Thus,  we  have 


p(0)  _  p(s)  =  Ap  =  Kp  |w|w  , 


(17) 


where  w  is  the  cross-flow  velocity  in  a  single  gap,  p^°)  is  the  pressure  out¬ 
side  the  channel,  p^S/  is  the  stagnation  pressure  inside  the  channel,  defined 
by  p(®)  »  p  +  pj^/2,  p*  is  the  density  of  the  fluid  flowing  through  the  gap, 
and  K  is  an  orifice  parameter. 


For  an  ideal  orifice,  the  parameter  K  is  given  by  the  Venturi  relation 


m 


K  =  (1  -  _£  )  , 

A? 

1 


(18) 


where  A2  is  the  gap  area,  and  Aj  is  the  unrestricted  area.  For  a  square  array 
of  reds  a|/A^  =  (h/(h+2rQ) ] 2  where  h  is  the  gap  width  and  rg  is  the  radius  of 
the  rods. 

Finally,  the  pressure  in  the  fluid  outside  the  channel  is  assumed  to  be 
hydrostatic,  so  that 


p(°)(z,t)  =  p£g(Vt-z)  +  pa  . 


(19) 


A  pressure  boundary  condition  is  needed  at  the  top  of  the  array.  We 
shall  assume  that  Bernoulli’s  equation  holds  in  the  exiting  stream.  Thus, 


hv 


pd.t)  +  %  psj32j2=l  ■  pa 


(20) 


We  shall  make  t*-To  further  approximations.  They  are 


Ps/P£  <  <  1 


(21) 


and 


|p-Pal/Pa  <  <  1  • 


(22) 


The  first  assumption  (21)  results  in  several  further  approximations.  Consider 
the  momentum  equation  in  the  post  dry-out  region  (6).  The  pressure  must  drop 
from  roughly  hydrostatic  at  z=A  to  atmospheric  at  z=L.  This  implies  that  jg 
must  be  appreciable,  so  that  Psjs2  ~  p£?  L*  Further,  examining  (12)  shows  that 


3s  >  >  ii 


(23) 


235 


for  Vt  <  z  <  L.  Integrating  eq.  (5)  for  the  two  subregions  gives 


6i  -  C(t)  +  8  _L  1  l’3/2[{L-z)  +  r  (Vt-z)]3/2 

*1  1+r  so 


for  A ,  <  z  <  Vt,  where  8  =  2.—  and 

0  3  A  K 


6j8  -  D(t)  +  (pa/p4)B  js  ir3/2a-z)3/2 

1  0 

for  Vt  <  z  <  L.  Here  pa  is  the  density  of  the  ataosphere  around 
top  of  the  bundle.  If  we  azsuae  pa  <<  p£,  we  have 


63Sl  h  D(t), 


The  velocity  will  be  continuous  at  z  *  Vt  if 


D(t)  -  C(t)  +  8  j  L~3/2(L-Vt)3/2/<1+r> 
s0 

The  pressure  is  given  by 


z 

Psq^Sq  [2p/  6j  dz'  +  6js  (z, t)  -  6j 
A  1  1 

0 

+  5p  (A+,t) 

1  0 


z 

Ps  3s  t2FJ  dz*  +  6js  (z,t) 
0  0  X  l  1 

0 

+  fip^L.t) 

for  Vt  <  z  <  l. 


With 


6p_ 


for  A  <  z  <  Vt,  and 
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6p  (L,t)  -  -  P8  j8  6js  (L,t) 
1  0  0  1 


■  -  WDM 


(39a) 

the  exposed 

(39b) 

(40) 

(A  ,t)l 
1  0 

(41a) 

(L,t)J 

(41b) 

(42a) 
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Sp  (A  ,t)  =  -  j|PO(A  ,t)6A  (t)  +  6p  (x’.t) 
AO  3z  0  1  10 


+  ap0(A  ,t)6A  (t)  -  2ps  js  6ja  (  A+,t) 
3z  0  1  S0  S0  J81  O' 


'  <FPso3so2  '  V^V*1 


“2psJs„  ICO:)  -  e  jg  L“3/2(L+rVt  .  ( i+r )  A  )3/2]  (42b) 
°0  i+r  0  0 


substituted  in  eqs.  (41),  we  have 


6p  (z,t)  =  -  PS  j3  {2FtC(t)(z-A  )  -  2  gj  L-3/2/(l+r)2 
x  CO  0  5*  0 


t(L-z)  +  r(Vt-z)] 5/2] 


+  J_  L~3/2(((L-*)  +  (Vt-z) ]  3/2 

0  1+r 


+  [(L-A  )  +  r(Vt-A  )]3/2)]l 
0  0 

+  (FPs0js02  ~ 


2PgojSoC(t)  (43a 


for  A  <  z  <  Vt,  and 


6Pi(z,t)  -  -  2pa^ja^D( t) [F(z-L)  +  1)  (43b) 
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The  initial  condition  on  A  is 

1 


6A  (0)  =  0  .  (48) 

1 


4.  DISCUSSION.  Figures  3  through  6  show  several  cases  of  A  versus  t,  for 
various  values  of  h,  r  and  L.  The  quantity  of  practical  importance  is  the 

0 

part  of  the  rods  exposed  to  water  on  the  outside  (z  <  Vt)  and  to  steam  on  the 

inside  (z  >  A(t)).  Generally,  the  maximum  fraction  of  the  rod  exposed  to  this 

extreme  situation  occurs  somewhere  in  the  middle  of  the  process.  In  Figures  4 
and  5,  however,  the  extreme  occurs  in  the  middle  of  the  process.  Figures  7 

through  10  show  the  maximum  fraction  exposed  versus  the  relative  gap  size  h/r 

0 

for  various  values  of  r  and  L.  Note  that  increasing  the  spacing  between  the 

0 

rods  decreases  the  maximum  fraction  of  the  bar  exposed. 

Let  us  discuss  the  model  and  the  process.  The  model  attempts  to  cap¬ 
ture  the  essential  physics  leading  to  extreme  behavior  in  the  quenching  fluid. 
The  two  essential  effects  resisting  the  hydrostatic  filling  of  a  fluid  sub¬ 
channel  are  friction  and  "rocketing".  Friction  is  largest  in  the  steam. 
"Rocketing"  is  a  large  momentum  change  near  the  relatively  thin  boiling 
region.  A  Bernoulli  effect  at  the  channel  outlet  assists  the  filling.  Indeed, 
in  this  model  the  effect  of  crossflow  from  the  outside  into  the  subchannel  is 
to  increase  the  Bernoulli  effect  at  the  top,  and  hence  to  raise  the  level  of 
the  liquid. 

The  model  suggests  that  decreasing  resistance  to  steam  flow  in  the 
array  will  raise  the  water  level,  and  hence  decrease  the  maximum  fraction 
exposed  to  extreme  conditions.  This  can  be  done  by  increasing  the  spacing 
between  the  bars.  This  also  will  increase  crossflow,  and  effect  which  also 
seems  to  and  the  minimization  of  maximum  fraction  exposed.  These  predictions 
concur  with  several  observations  about  the  process. 

1.  The  process  works  for  one  rod.  For  no  friction,  the  model  predicts 
A  =  Vt. 

2.  Lengthening  the  rods  and  decreasing  their  diameter  (and,  presumably, 
packing  them  more  closely  in  the  bundle)  increases  friction,  and  increases 
the  maximum  fraction  exposed.  The  process  is  known  to  work  for  shorter, 
thicker  rods,  and  has  some  problems  with  longer,  thinner  rods. 
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BEYOND  STEFAN  PROBLEMS:  THE  STRUCTURE 
OF  SHEARED  SOLIDIFICATION  FRONTS* 


F.  S.  HALL  &  G.  S.  S.  LUDFORD 
Theoretical  and  Applied  Mechanics 
Cornell  University 
Ithaca,  NY  14853 


ABSTRACT.  Determination  of  the  motion  of  a  solidification  front  into 
a  molten  metal,  a  so-called  Stefan  problem,  does  not  depend  on  detail  of 
the  solidification  process  inside  the  front.  Such  details  are  needed, 
however,  when  the  liquid  is  being  stirred  by  a  magnetic  field,  since 
they  determine  the  structure  of  the  coincident  Hartman  (viscous)  laysr, 
and  hence  the  secondary  motion  in  the  liquid.  Within  this  layer  the 
viscosity  is  variable,  due  to  the  spacial  variation  in  the  concentration 
of  solid  particles.  In  the  absence  of  a  theory  in  the  nucleation 
literature,  a  growth  law  for  the  concentration  of  solid  particles  is 
proposed  for  the  small  undercooling  that  must  occur  in  such  a  layer. 

The  (asymptotic)  analysis  is  carried  out  for  the  slow  solidification 
of  a  cylinder  of  molten  metal  that  is  being  stirred  by  a  rotating, 
uniform  magnetic  field,  but  the  results  are  of  general  validity. 

I.  INTRODUCTION.  The  propagation  of  a  solidification  fror ;  into  a 
pure  molten  metal  at  rest  is  a  consequence  of  the  solid  sending  out  shoots 
into  the  liquid.  As  a  shoot  grows  it  sends  out  its  own  shoots,  the 
process  repeating  itself  to  fc:m  a  tree-like  structure  known  as  a 
dendrite.  The  growth  of  a  dendrite  is  limited  only  by  its  neighbors 
and  the  availability  of  liquid  between  its  shoots.  A  rotating  magnetic 
field  causes  the  liquid  metal  to  flow  along  the  solidification  front  so 
as  to  break  off  the  dendritic  shoots  as  soon  as  they  form,  i.e.  while 
they  are  still  small.  These  minute  solid  particles  migrate  through  the 
shear  layer  towards  the  liquid  core,  providing  nuclei  for  the  solidifi¬ 
cation  process.  The  layer  is  therefore  viewed  as  a  gradation  of 
neutrally  buoyant  particles,  their  concentration  tending  to  zero  on  the 
liquid  side  and  to  one  on  the  solid  side. 

In  the  absence  of  a  theory  determining  the  rate  at  which  the  den¬ 
dritic  fragments  grow,  we  shall  propose  a  growth  law.  The  object  of 
this  paper  is  then  to  show  how  the  structure  of  the  viscous  layer  is 
modified  when  it  is  also  a  solidification  layer,  and  to  determine  the 
disturbance  to  the  main  core  flow  that  is  caused  by  this  viscous  layer. 


♦Supported  by  the  U.S.  Army  Research  Office. 


II.  MAGNETOHYDRODYNAMIC  STIRRING.  The  structure  of  the  viscous 
layer  is  governed  by  the  continuity  and  Navier-Stokes  equations,  the 
latter  being  modified  by  tna  Lorent-^  body  force.  In  dimensionless  forms 
these  equations  are 

v*v=0,  if^Dv/pt  ■  -yp  +  jxB  +  M~2y*{n(yv+yyT)},  (l) 

where  N  and  M  are  the  interaction  and  Hartmann  numbers,  respectively. 

We  consider  the  case  of  slow  stirring  (i.e.  N-*») ,  so  that  th|  inertia 
terms  are  neglected.  The  inverse  of  the  Hartmann  number,  M  ,  is  a 
measure  of  the  "thickness"  of  the  viscous  layer.  Allowance  has  been 
made  in  equations  (1)  for  a  variable  viscosity  (dimensionless). 

In  order  to  determine  how  the  velocity  field  varies  as  we  traverse 
this  viscous-solidification  layer,  we  need  to  know  how  the  viscosity 
varies  through  the  layer.  Certainly,  the  value  of  the  viscosity  at  any 
point  within  the  layer  is  determined  by  the  concentration  of  solid  (or 
liquid)  at  that  point.  With  this  in  mind,  we  now  consider  the  solidifi¬ 
cation  process. 

III.  SOLIDIFICATION  PROCESS.  The  mod  1  that  we  shall  adopt  is  as 
follows.  The  dendritic  shoots  that  form  at  the  solid  are  supposed  to  be 
broken  off  by  the  scouring  action  of  the  shear  flow  while  they  are  still 
small.  The  solidification  layer  therefore  consists  of  particles  whose 
concentration  tends  to  one  on  the  solid  side,  and  to  zero  on  the  liquid 
side  of  the  layer. 

In  the  absence  of  such  particles,  i.e.  for  so-called  homogeneous 
nucleation,  the  solidification  rate  depends  on  the  amount  of  the  under 
cooling  from  Te,  the  temperature  at  which  solid  and  liquid  are  in 
equilibrium.  For  each  temperature  less  than  Te,  a  potential  nucleus 
must  have  a  critical  size  in  order  to  grow,  and  it  is  a  matter  of 
determining  how  many  of  the  particles,  i.e.  clusters  of  atoms  that  are 
continually  forming  and  dissolving,  are  large  enough.  The  relation 
between  the  temperature  and  the  rate  of  increase  in  concentration  of 
solid  turns  out  to  be 

4r  (1-c)  =  kc  exp  [-  ■■-Ct-g  ),  (2) 

C  ( 1-T)  x 

where  c  is  the  liquid  concentration,  the  unit  of  temperature  is  Te,  and 
k  is  a  (very  large)  constant.  The  value  of  a,  also  a  constant,  is  1.14 
for  aluminum.  Near  the  equilibrium  temperature,  i.e.,  for  small  values 
of  1-T,  the  right  side  is  negligibly  small,  corresponding  to  the  re¬ 
quirement  of  a  large  critical  size;  as  1-T  increases  the  exponential 
slowly  increases  until  at  some  value,  11*  10  for  aluminum  (corre¬ 
sponding  to  1Q£ °K  undercooling  from  Te  =-  933°K),  the  coefficient  of  c 
suddenly  becomes  appreciable  and  solidification  occurs  in  a  fraction  of 
a  second.  One  concludes  that  there  is  a  definite  temperature  at  which 
nuclei  of  critical  size  are  present  in  sufficient  numbers  to  cause 
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almost  Instantaneous  solidification.  This  temperature  will  be  referred 
to  as  the  solidification  temperature,  Ts. 

When  extraneous  particles  are  present  to  act  as  nuclei,,  less  under¬ 
cooling  is  needed  for  solidification.  The  number  of  particles  is  in¬ 
creased,  so  that  there  are  now  more  of  critical  size  at  any  termperature . 
There  is  no  theory  of  the  nucleation  rate  in  such  circumstances,  so  ve 
propose  to  use  the  same  formula  (2)  with  a  smaller  value  of  a.  (More 
precisely,  we  may  argue  that  the  effective  change  in  free  energies  from 
solid  to  liquid  are  smaller  because  the  energies  of  the  extraneous 
particles  are  not  to  be  counted;  since  the  two  energies,  volume  and 
surface,  are  reduced  proportionately,  the  formula  follows  from  the  same 
analysis  as  for  homogeneous  nucleation.) 

In  applying  the  law  (2)  to  a  cylindrically  converging  solidification 
front,  we  will  go  to  a  frame  moving  with  the  front.  Then,  if  the  speed 
of  the  front  is  small  enough  for  the  process  to  be  considered  quasi- 
steardy  relative  to  the  front,  we  may  write 

Vndc  =  -kc  exp  f - — a-] ,  (3) 

dr  (1-T) 

where  VQ  is  the  (very  small)  dimensionless  speed,  and  the  time  derivative 
has  been  replaced  by  VQd/dr.  To  complete  the  problem  we  must  add  the 
heat  equation 


„  dT  .-lid  ,  dT.  dc 

V°dr  -  Pe  F  d?  (r  *  'KV°  dr’ 


where  K  and  Pe  are  constants  (Pe  is  the  Peclet  number,  which  is  supposed 
to  have  the  same  constant  value  in  the  solid  as  it  has  in  the  liquid; 
this  assumption,  equality  of  the  two  thermal  diffusivities,  is  easily 
removed) . 

IV.  THE  SOLIDIFICATION  LAYER.  If,  as  we  shall  suppose,  t^e  solidi¬ 
fication  front  has  a  dimensionless  thickness  comparable  to  M  ,  its 
structure  must  be  investigated  in  order  to  determine  how  the  effective 
viscosity  n  changes  from  1  to  »  in  going  from  the  liquid  to  the  solid 
side  of  layer.  Several  theories  (Mancini,  1984)  for  the  effective 
viscosity  have  been  proposed,  each  yielding  a  different  dependence  on 
concentration.  Since  we  do  net  have  enough  information  about  the 
dendritic  particles  to  select  anyone  of  these  theories,  we  shall  take 
the  simplest  relation. 

nel/c^  (5) 

as  as  illustration.  Once  c,  and  hence  n,  is  found,  we  can  determine  the 
structure  of  the  viscous  layer  (i.e.,  the  velocity  field.) 
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Whenever  there  is  a  layer,  there  must  be  a  small  parameter  e»  and 
the  question  is  to  identify  e.  The  amount  of  undercooling  is  certainly 
a  candidate,  but  the  amount  that  occurs  in  ordinary  solidification 
processes  is  too  small,  leading  to  thicknesses  much  less  than  M~  . 
However,  when  dendritic  shoots  are  prevented  from  forming,  and  only 
small  particles  are  in  contact  with  the  liquid,  the  cooling  may  be  much 
larger  (though  still  small).  Such  is  the  case  here,  and  we  shall  pro¬ 
ceed  on  the  assumption  that  the  unde^co^  •m'  is  sufficient  to  make  the 
solidification  layer  of  thickness  M  .  (Thi*.  assumption  should  be 
verified  experimentally) . 

To  determine  the  amount  of  undercooling 


e»l-Ts  (6) 

necessary  to  make  the  two  layers  of  the. same  thickness,  we  set 

a*ae?',T=T  +  eV  /2^+. . . ,  (7) 

s  i 

where  a»£,m  are  positive  constants  and  T^  represents  the  temperature 
perturbation  (about  T  )  in  the  layer.  The  exponent  in  equation  (3) 
is  then  3 


a 

(1-T)2T 


~l-2 

_  e _ £+m-3_ 

cx  ei  «t  •  •  • 

l-e  1 


(Expansion  of  the  exponent  is  similar  to  that  in  the  asymptotic  approach 
to  combustion  theory).  The  first  term  is  cancelled  by  the  pre-exponential 
factors,  i.e. 

5e£~2  -  in(K/V0)  (9) 

to  leading  order,  implying  £,<2;  the  second  is  expected  to  provide  0(1) 
variations  in  the  layer,  i.e.. 


£.+m-3  *0. 


(10) 


requiring  m<l.  It  follows  that  the  layer  variable  for  solidification 
must  be 

£*2a£en"m(l-r)  (11) 

if  a  small  temperature  gradient  &en(n>0)  outside  the  layer  is  to  be 
matched. 


Now  the  solidification  layer  is  equal  thickness  to  the  viscous  layer. 
This  amounts  to  identifying  the  solidificaion  variable  R  with  the 
appropriate  variable  R  for  the  viscous  layer.  But  the  viscous  layer  is 
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the  region  where  the  viscous  term  in  equations  (1)  is  as  important  as 
the  other  terms  (in  the  limit  as  M*»>) ;  so  that 


R=M(l-r) . 


Setting 


now  shows  that 


e  «  2a8M 


m-  -1 

where  e  measures  the  thickness  of  solidification  layer,  and  M  measures 


the  thickness  of  viscous  layer. 


Equations  (9),  (10),  and  (14)  show  that 


e1~n=2$M~1£,n(K/Vrt) . 


This  relation  is  of  some  importance  because  it  does  not  depend  on  a,  a 
quantity  not  amenable  to  experimental  determination.  More  information 
about  £  is  obtained  by  writing  the  basic  equation  (3)  and  (4)  to  leading 
order  in  the  layer  variables. 


-T  d  T  -  dc 


Here  we  have  set 


~  1-2 
a  e 


(i-'e)~  ~£n  (lc/Vo>  “  C®~n)  ine- Rn(2a3),Pe  -KV^Pe 


In  equation  (16),  Tj  has  been  replaced  by  T,  and  c  now  stands  for 
its  leading  term  in  an  expansion  of  the  type  (7b).  Equation  (17a)  is 
just  a  more  accur.u:e  version  of  the  requirement  (9),  but  equation  (17b) 
gives  an  additional  relation  for  s.  Combining  equations  (15)  and  (17b) 
determines  e in  terms  of  known  constants. 


V.  SOLUTION  OF  LAYER  PROBLEM.  There  is  no  temperat  are  gradient  in 
the  melt  to  order  ens  so  that  the  layer  equations  (16)  have  to  be  solved 
under  the  boundary  conditions 


c-*-l  ’  — ■  ->G  as  R-H-  ® 


on  the  liquid  side.  The  requirement 


►1  as  c-»0 


Mj 


is  then  automatically  satisfied,  as  is  seen  from  the  integral 


S=dT/dR  =  1-c  (20) 

of  equation  (16b).  This  expresses  nothing  more  than  the  jump  condition 
in  the  Stefan  problem. 


If  we  substitute  equation  (20)  into  the  remaining  layer  equation 
(16a),  we  obtain 


,dT  ~T 
(dR_1)  e 


(21) 


The  solution,  in  parametric  form,  is 

T=  -In  [-£n(l-S)-S] ,  R=  (1.S)  [|+4n(l-S)  ]  ’  ^  ^ 

The  second  of  the  two  equations  is  of  greater  importance  since, 
according  to  definition  (20) ,  S  is  the  solid  concentration  in  the  layer, 
and  hence,  directly  related  to  the  viscosity  r\  by  equation  (5),  i.e. 

n=(l-S)“2.  (23) 

This  formula  for  n  ,  when  substituted  into  the  Navier-Stokes  equations, 
allows  us  to  determine  the  structure  of  the  viscous  layer. 

VI.  DISCUSSION.  We  have  made  a  number  of  assumptions  in  the  preceding 
analysis  which  should  be  explored  here.  The  viscous  layer  and  the 
solidification  layer  were  required  to  have  the  same  thickness.  The 
assumption  enabled  the  structure  of  the  viscous  layer  to  be  determined 
without  reference  to  the  details  of  the  solidification  process. 

If  no  assumption  is  made  about  these  two  thicknesses,  details  of  the 
solidification  process  must  be  understood  in  order  to  determine  how  the 
effective  viscosity  varies  from  that  of  the  pure  liquid  to  infinity 
(the  value  in  the  pure  solid) .  A  complete  treatment  would  require 
theories  of 


i)  the  size  and  shape  of  the  fragements  broken  of  by  the  shear  flow. 
This  is  a  very  complicated  problem,  which  involves  relating  the  break¬ 
ing  strength  of  the  dendritic  shoots  to  the  distributed  forces  applied 
to  them  by  the  liquid.  To  our  knowledge,  no  investigations  have  been 
done  in  this  area. 

ii)  the  rate  at  which  these  dendritic  fragments  grow.  Even  the 
formation  of  nuclei  (i.e.  clusters  of  electrically  neutral  atoms)  in 
the  absence  of  extraneous  particles  must  be  reinvestigated.  In  the 
classical  neucleation  theory,  electrically  neutral  liquid  atoms  come 
together  to  from  solid  particles,  but  this  must  be  modified  in  the 
present  case.  Because  of  the  applied  rotating  magnetic  field,  currents 
are  generated;  and,  since  a  current  is  nothing  more  than  a  flow  of 
electrons,  this  implies  that  the  atoms  in  the  solidification  layer  are 
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no  longer  neutral.  The  theory  would  then  have  to  be  extended  so  as  to  take 
account  of  the*  extraneous  particles. 

ill)  the  effective  viscosity  of  the  resulting  suspension.  There  are 
almost  as  many  laws  relating  viscosity  to  concentration  as  there  are 
investigators.  Each  gives  different  results  depending,  fct  example,  op 
the  size  and  shape  of  the  particles.  We  chose  a  simple  law  relating 
viscosity  to  concentration  for  discussion  purposes  only.  However,  we  do 
not  expect  the  qualitative  nature  of  the  results  to  depend  on  the  particular 
viscosity-concentration  relation  used.  Even  if  a  theory  is  formulated  to 
describe  the  3ize  and  shape  of  the  particles,  a  more  sophisticated  and 
accurate  relation  between  viscosity  and  concentration  can  --  -.obstructed. 

We  sidestepped  the  above  three  items  because  our  main  purpose  was  to 
determine  the  qualitative  structure  of  the  viscous  layer,  and  hence  the 
disturbance  that  it  causes  in  the  core  flow.  A  fuller  treatment  of  the 
subject  is  given  by  Hall,  Ludford  &  Walker  (1984). 
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SHOCK- INDUCED  THERMAL  RUNAWAY 


T.  L.  Jackson  and  i..  K.  Kapi  la 
Department  of  Mathematical  Sciences 
Rensselaer  Polytechnic  Institute 
Troy,  New  York  12189-3590 


ABSTRACT.  Ignition  of  an  initially  cold  combustible  gas  is 
studied  when  chemical  reaction  is  switched  on  due  to  the  passage 
of  a  piston-supported  shock  of  sufficient  strength.  The  time  and 
space  history  of  the  shocked  gas  up  to  the  instant  of  thermal  runaway 
is  described,  by  using  a  combination  of  asymptotics  and  numerics. 

1.  INTRODUCTION.  Consider  a  combustible  material  confined 
to  the  half  space  x  >  0  and  capable  of  undergoing  an  exothermic 
chemical  reaction  of  the  Arrhenius  type.  Suppose  that  the  temperature 
of  the  material  is  so  low  that  it-  is  practically  inert,  and  will 
not  burn  if  left  alone.  Let  combustion  be  initiated  by  applying 
an  ignition  stimulus  at  the  boundary  x  =  0  .  It  is  of  considerable 
interest  to  obtain  a  mathematical  description  of  the  events  following 
the  application  of  the  stimulus.  In  particular,  if  a  combustion 
wave  propagating  through  the  material  is  eventually  established, 
it  is  important  to  understand  the  evolutionary  process  that  gives 
rise  to  it. 

Recently,  Kapi la  [1]  has  treated  the  case  where  burning  is 
initiated  by  applying  a  heat  flux  at  the  boundary.  Building  cn 
the  pioneering  work  of  Lifiafi  and  Williams  [2]  and  Kassoy  [3],  the 
analysis  employs  a  combination  of  large  activation  energy  asymptotics 
and  numerics,  and  is  able  to  describe  rather  completely  the  entire 
course  of  events  culminating  in  the  development  of  a  well-defined 
deflagration  wave.  A  major  drawback  of  the  study,  however,  is  that 
it  completely  ignores  the  motion  and  deformation  of  the  material, 
and  is  therefore  of  limited  value  for  gaseous  combustibles.  Applica¬ 
tion  of  heat  causes  thermal  expansion  of  the  gas,  thus  producing 
a  strong  coupling  between  gasdynamic  and  chemical  aspects  of  the 
problem,  in  one  particular  context,  such  a  coupling  has  been  studied, 
independently,  by  Blythe  [4]  and  Clarke  [5],  They  consider  a  spatial¬ 
ly  homogeneous,  weakly  reactive  atmosphere  into  which  small-amplitude 
out  rapid ly-varying  gasdynamic  disturbances  are  introduced.  Their 
work  examines  the  effect  of  chemical  heat  release  on  shock  formation 
and  the  influence  of  pressure  disturbances  on  thermal  runaway. 

The  present  analysis  is  very  much  in  the  spirit  of  [4-5]  but 
is  concerned  with  a  problem  more  like  the  gaseous  counterpart  of 
that  created  in  [1].  The  ignition  stimulus  is  modelled  by  an  impul¬ 
sively  started  piston  cioving  into  the  gas  at  a  constant  speed. 

This  creates  a  shock  wave  which  runs  ahead  of  the  piston,  and  will 
travel  at  a  constant  speed  if  the  gas  were  inert.  It  is  assumed 
here,  however,  that  the  shock  is  just  strong  enough  to  raise  the 
temperature  of  the  gas  behind  it  to  the  ignition  temperature,  so 
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that  the  chemical  reaction  is  switched  on  due  to  the  pasr-'ge  of 
the  shock.  The  purpose  of  the  analysis  is  to  describe  the  state 
of  the  gas  between  the  piston  and  the  shock,  and  to  determine  the 
extent  to  which  the  shock  is  accelerated  by  chemical  heat  release, 
up  to  the  instant  of  thermal  runaway.  Post-runaway  events  are  cur¬ 
rently  under  study. 

2.  GOVERNING  EQUATIONS  AMD  ASSUMPTIONS.  The  equations  of 
reactive  gasdynamics  for  plane,  one-dimensional,  unsteady  motion 
are  [6] 


pt  +  upx  +  pux  =  0  , 

(1) 

p(ut  +  uux)  +  ^  Px  =  0  , 

(2) 

yt  +  u7x  =  -w  » 

(3) 

P  =  pT  , 

(4) 

uTx)  -  (pt  +  upx)  =  Bpw  , 

(5) 

where 


a~ 


4 


a 


expl; 


-] 

TJ 


f*> 


(7) 


Here,  y  is  the  mass  fraction  of  the  reactant  in  the  combustible 
gas,  T  the  temperature,  p  the  density,  p  the  pressure  and 
u  the  velocity.  The  variables  y,  T,  p  and  p  have  been  made 
dimensionless  by  referring  them  to  the  corresponding  quantities 
in  the  cold,  stationary  gas  ahead  of  the  shock.  The  scale  for  veloc¬ 
ity  is  taken  to  be  the  frozen  sound  speed  in  the  cold  gas,  while 
time  is  referred  to  the  homogeneous  induction  time  of  the  shocked 
gas  at  the  instant  the  shock  is  geneioted.  The  initial  (dimension¬ 
less)  shock  speed  in  denoted  by  M0  ,  such  that  M0  -  1  *  0(1)  , 
and  y  is  the  specific-heats  ratio.  Transport  effects  are  ignored, 
and  a  one-step  first-order  Arrhenius  reaction  postulated,  with 
0  the  dimensionless  activation  energy  and  8  the  heat-release 
parameter.  The  analysis  will  proceed  on  the  assumption  that 

i.e. , 


9  »  1 


» 


e  <<  0 


(8) 


PS|IUR3ATI0N  ANALYSIS.  In  the  absence  o£  chenistry  (v  5  0), 
the  state  of  the  gao  is  given  by 


p  =  p=  T  =  y  = 


u  =  0  for  x  >  Mpt  , 


P  ~  P°*  P  p°>  T  To>  y  '  D  u  ~  uQ  for  uQt  <  x  <  H0t  ,  (9b) 

“°  are  reUted  co  M°  thr3u8h  thc  a*»ki— 


_  2rMQ  +  1  -  y 

Pc  “  y  +  i 


(y  +  i) 


(Y  -  1)  Mq  +  2 


(2y  Mp  +  1  -  Y)  {(y  -  1)  Mp  +  2? 
<Y  +  l)2 


2(Mq  -  1) 

uo  -  7 - 77—  •  UO) 


(Y  +  1)M0 


Observe,  in  particular,  that  uQ  is  also  the  sr .ed  of  the  piston. 

•  ihe  Presence  of  Tc  in  the  exponent  of  (6)  indicates  that 
in  the  limit  0  - ■  -  being  considered  here,  T0  is  the  switch-on 

evolutioUre-f°r  fhC  Che,uical  action.  In  the  initial  stages  of 

J™ V 6re^rt*  thC  SCate  °f  8hocked  gas  will  deviate 
-rom  (9b)  by  an  0(e)  amount,  as  will  the  speed  of  the  shock.  Tf 
che  perturbed  speed  of  the  shock  is  taken  to  be 

M  =  H0  +  eMj  ,  (u; 

the  conditions  (10)  show  that  the  state  of  the  gas  immediately  behind 
the  shock  will  become,  to  0(e),  y  Denina 
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All  perturbation  quantities  will  depend  upon  x  and  t  ,  excepi 
the  shock-speed  perturbation  ,  which  will  depend  upon  t  a 

Substitution  of  (13)  into  (l)-(6)  yields  the  following  eq<  „ 1  .oni 
for  the  disturbances: 

Plt  +  uo  °lx  +  Po  ulx  *  0  » 

Po<ult  +  °o  ulx>  +  7  Plx  *  0  » 

PI  11  Po  T1  +  To  PI  » 

Po(Tit  +  uo  Tlx>  "  (Plt  +  uo  Plx>  "  Po  *?1  » 

.  „  1  Ti 

yit  +  uo  yix  s  -  b  «  1  • 

The  reactant  equation  (IS)  is  uncoupled  from  the  set  (14)-(18)  1 
except  for  the  nonlinear  source  term,  governs  linearized  acoust 
in  a  steadily  moving  medium.  It  is  convenient  to  introduce  a  n 
spatial  variable  £  vie.  the  transformation 


and 


1  Ti 
_  e  1 


Also,  the  piston  is  brought  to  re3t  at  the  location  £  =  0  ,  while 
;he  shock  trajectory  is  given  by  .  E,  =  j ^  Vdt  ,  where 


M  -  u. 


V0  +  eVx  ;  Vc 


M0  -  u0 


On  using  the  relevant  expressions  from  (10),  V0  can  be  written 

£  S 

<Y  -  l)Mg  +  2  p 

VD  *  - 5 - 2 - >  .  (23: 

°  2  M2  -  y  +  1  I 

Equations  (20)  are  to  be  solved  subject  to  the  requirements  chat 
at  the  piston,  u^  vanishes  and  at  the  shock,  the  conditions  (12a-d) 
must  be  satisfied.  Since  analysis  is  only  being  carried  to  0(e) ; 
the  shock  conditions  can  be  applied  et  the  undisturbed  shock  path 
£  *  VQt  .  Henceforth  we  shall  ignore  eqn.  (21)  because  once  Tj 
is  known,  yj  can  be  determined  by  r  simple  integration  of  (21) 
under  the  condition  that  y^  vanishes  initially.  Also,  in  thci 
set  (20),  pj_,  and  uj  will  be  treated  as  the  fundamental  vari¬ 
ables,  because  is  defined  by  the  third  equation  of  (20). 

It  is  a  simple  matter  to  distill  from  (20)  the  following  single 
partial  differential  equation  for  Ti  : 


a2  v  tx 

~)e 

a  € 


This  equation  clearly  reflects  the  coupling  between  acoustic  motion 
and  thermal  explosion,  and  signals  the  presence  of  the  isothermal 
sound  speed  in  addition  to  the  usual  frozen  sound  speed.  It  is 
not,  however,  particularly  conducive  to  the  construction  of  a  solu¬ 
tion.  For  the  latter  purpose  it  is  best  to  return  to  the  set  (20) 
and  rewrite  it  in  the  characteristic  form 


a  r_.  .  ,  >  ypo  fi 

TI  (Pi  +  Y  ao  Po  5l)  =  T  e  1  > 


(25a) 


TT  -  Y  a0  P0  5lJ  *  ~T~  e  1 


(251) 


(Pc  T1  ‘  Pi)  -  Po  eTl  » 


(25c) 
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where  the  characteristic  coordinates  r  ,  s  are  defined  by 


and  the  notation 


t  +  £  rt  r  ,  t  -  £  =  s 

d>  (C  »  t )  =  4>  ( r ,  s) 


(26) 


(27> 


is  employed. 

Fig.  1  shows  the  geometry  of  the  situation,  including  a  typical 
point  P  between  the  piston  and  the  shock  and  the  three  characteris¬ 
tic  lines  passing  through  it-  It  is  a  simple  matter  to  carry  out 
numerical  integration  of  equations  (25)  along  the  characteristics. 

4.  NUMERICAL  RESULTS 


The  numerical  calculations  were  performed  at  y  s  1.4  and 
M0  =  2.646  .  Then,  the  remaining  parameters  characterizing  the 
shock  at  t  =  0  are  lour'*  to  be 

pa  =  8.000,  Oq  =  3.500,  T0  *  2.286,  uD  *  1.890, 

ac  =  1.512,  VG  *  0.500,  WD  *  0.571  . 

A  small-time  solution,  developed  analytically,  was  employed  to  begin 
integration  at  t  *  0.001  .  Integration  was  terminated  at  t  *  0.875 
when  the  development  of  large  temporal  gradients  in  the  solution 
signalled  the  imminence  of  thermal  runaway.  Figures  2-4  exhibit 
the  graphs  of  Tj,  pi  and  uj  against  5  at  various  values  of 
u  while  Fig.  5  shows  the  variation  with  time  of  the  shock-speed 
perturbation  raj  . 

As  one  would  expect,  the  greatest  temperature  rise  is  at  the 
piston  face,  where  the  residence  time  for  the  shocked  gas  is  the 
longest;  a*.  any  given  time,  temperature  decreases  monotonically 
away  from  the  piston.  It  is  instructive  to  compare  this  induction 
process  with  the  constant-volume,  spatially  homogeneous  chemical 
heating  which  would  occur  if  the  piston  were  held  stationary  snd 
the  entire  bulk  of  the  gas  were  brought  instantaneously  to  the  igni¬ 
tion  temperature  Tc  at  t  *  0  .  The  corresponding  solution,  ob¬ 
tained  from  equations  (20)  in  the  absence  of  spatial  gradient3  and 
under  nail  initial  conditions,  is 

Ti  =  -£n(l  -  yt)  , 

for  which  time- to- thermal  runaway  is  1/y  *  0.714  ,  as  against  the 
value  0.875  obtained  for  the  shock-induced  case. 

At  thermal  runaway  and  beyond,  unb^endedness  of  the  solution 
in  the  vicinity  of  the  piston  renders  the  expansions  (13)  invalid. 
Further  development  of  the  explosive  process  is  uuder  study. 
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An  expanded  version  of  this  paper  will  appear  in  the  SIAM  Journal 
on  Applied  Mathematics . 
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FIGURE  CAPTIONS 

Fig.  1.  The  flow  geometry. 

Fig.  2.  Evolution  of  temperature  perturbation  T^  . 

Fig.  3.  Evolution  of  pressure  perturbation  . 

Fig.  4.  Evolution  of  velocity  perturbation  u^  . 

Fig.  5.  History  of  shock-speed  perturbation  m'^  . 
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ABSTRACT.  A  certain  model  of  one-dimensional  detonation  waves  leads  to 
a  Stefan  problem:  the  unknown  f  satisfies  Burgers  equations  on  the  two  sides 
of  a  moving  discontinuity  at  which  it  is  given  (f^»  say)  and  the  jump  in  its 
derivative  (corresponding  to  the  exothermic  reaction)  is  prescribed.  An 
alternative  formulation  of  the  problem  can  be  obtained  by  means  of  the  Hopf- 
Cole  transformation,  which  replaced  the  Burgers  aquations  by  diffusion  -  type 
equations. 

The  problem  possess5-;  steady  solution,  the  discontinuity  moving  with 
constant  speed  and  f  depending  only  on  distance  from  it.  This  solution  is 
stable  for  a  range  of  the  parameter  f,  and  unstable  otherwise,  as  was  shown 
at  tne  First  Army  Conference,  when  preliminary  results  on  the  subsequent 
evolution  of  the  instability  were  presented.  The  instability  has  now  been 
reexamined,  using  three  computation  schemes  on  each  of  the  two  formulations 
of  the  problem,  resulting  in  the  more  definite  conclusions  presented  here. 

I .  INTRODUCTION .  Work  on  detonation  waves  by  Stewart  and  Ludford  [1] 
(presented  at  the  First  Army  Conference)  showed  that  galloping  detonations 
evolve  with  time.  We  have  now  re-examined  this  instability  with  a  more 
refined  numerical  technique.  In  addition,  using  asymptotic  analysis  for 
small  time  step  we  show  how  the  velocity  of  *’he  wave  is  determined  as  it 
evolves.  This  velocity  depends,  in  general,  only  ,a  the  gradients  and 
curvatures  on  the  two  sides  of  the  flame  sheet;  but,  when  the  gradients  are 
equal  and  opposite,  the  third  derivatives  are  involved. 

Our  model  of  a  one-dimensional  detonation  wave  [1]  leads  to 

£]fn.Xf„n. 

subject  to 

f(0,T)  =  f*  (given),  [fj  =  fn(0+,T)-fTi (0~,T)  =  -(Y+l)a2/2y 
f(-«,T)  =  0,  f (+°°»T)  =■  f+  (given), 
with  a  given  initial  condition  f(n,0)  =  G(q). 

The  unknown  f  satisfies  a  Burgers  equation  on  the  two  sides  of  the 
moving  discontinuity  (which  has  been  reduced  to  rest  in  the  ^crmulation 
above)  and  represents  the  disturbance  of  a  quiescent  state  ahead  of  the 
wave  (n  -»■  -«).  The  velocity  of  the  wave  is  given  by  k(T).  All  the  reaction 
takes  place  at  the.  moving  discontinuity,  where  f  =  f^  >  0  always.  The 


Cl) 


(2) 
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derivative  of  f  takes  a  jump  from  one  side  to  the  other  of  the  discontinuity 

(flame).  The  constants  a  and  v  (ratio  of  specific  heats)  are  assignable, 

as  is  f  (>0) . 

+ 

In  the  classical  Stefan  problem  of  ice-solidification,  the  jump  in  slope 
at  the  moving  discontinuity  is  proportional  to  k(T)  and  the  nonlinear  term 
is  missing  from  equation  (1).  The  first  of  these  ensures  that  the  velocity 
is  always  determined  by  the  gradients  and  curvatures  at  the  discontinuity, 
while  the  second  makes  no  difference. 


II.  RESULTS  FROM  STABILITY  ANALYSIS.  The  problem  as  it  stands  is  over 
determined.  If  k(T)  is  supposed  given:  there  are  three  conditions  at  rj  =  0, 
namely 

f(cr,T)  =  f*  ,  ty  -  -(r^)a2.  (3) 

The  extra  condition  determines  the  speed  k (T)  of  the  flame.  Prescribing- 
f+,  a  and  restricting  f^  to  a  certain  interval,  namely  f4c  (ct“/f^»(f^+a  )/f  )» 
yields  a  steadv  solution  with 

f 2  +  a2 

k=ks  =  “2ir—  -a*  (4) 

The  steady  solution  may  be  written 


21c  f  .e  "/[2k  -f.+fe  **] 

S  *  s  *  *r- 

fs-  ? 

[f_(f+-f*)+f+(f*-f_)e  +]/[(f+-f  *>+(«*-*>  +] 


for  n  >  0.  (5) 


provided  k  >  a ;  here 


=  (Y+l)kgn/y»  **  (y+l) (f+-f_)n/2y,  f_  *  a  /ft 


Examining  the  stability  of  the  steady-state  solution  by  setting 


k  -  kg  +  eXT,  f  =  fs(h)  +  F(n)eXT  (e  «  1) 


and  substituting  into  the  problem  (1)  and  (2),  we  find  that  the  eigen¬ 
value  X  satisfy  the  dispersion  relation 


-  «a-£*Kf»-0  «+-**>> 


where 
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kg  +  *4^+A,  fb  =  ks  ”  >^s-a2+A>  A  -  ByX/(y+1)2. 


(9) 


This  result  is  due  to  Stewart  [2],  Examination  of  this  dispersion  relation 
reveals  that  A  is  always  real  and  that  its  sign  is  as  shown  below. 
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steady  solution  exists 
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Figure  1.  Showing  the  signs  uf  the  eigenvalue.  A  <  0  corresponds  to 
stable  while  A  >  0  is  unstable. 


III.-  NUMERICAL  RESULTS.  Stewart  and  Ludford  fl]  examined  this  problem 
at  the  First  Army  Conference;  in  particular,  some  numerical  calculations  were 
made  with  =  1.50,  f+  =  0.9128  and  a  =  r 2/(y+l)  =>.9128(far  y»  7/5)  using  a 
perturbed  steady-state  profile  as  initial  conditions.  One  result  is  shown 
in  Figure  2,  and  another  (exhibiting  peaks  )  in  Figure  5. 

The  problem  was  re-examined,  using  a  more  accurate  integration  scheme. 

The  numerical  procedure  is  schematically  as  follows: 


f_  =  S(f  ,f  ,f ,k) ,' 
T  nn  n 


(10) 


f(0“,T)  =  f*,  f (=°»T)  -  f+,  f{-~,T}  =  0. 


(ID 


together  with  initial  conditions.  In  Stewart  and  Ludford  fl]  explicit 
schemes  used  centered  fifferences  in  evaluating  S;  here  a  machine  package 
was  used  to  do  this  more  accurately.  We  search  for  K(T)  until 


R  =  [f  ]  +  0.7142, 

n 

,-7 


(12) 


2, , 


was  less  than  10  '.  (Here  0.71^2  is  the  value  of  (y+1)cx  '/2y  for  the 
q,y  given  above.)  The  integration  goes  forward  in  time  until  no  value  of 
k  could  be  found  to  make  R  small  enough.  By  using  standard  IMSL  routines, 
the  repsonse  in  Figure  3  was  obtained.  At  the  point  of  breakdown*  a  plot 
of  R  against  k  immediately  before  breakdown  is  shown  in  Figure  4. 
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The  unknowns  in  the  set  (20).  are  B+  and  K*,  while  the  determinant  of 
the  system  is  G++G  .  If  G++G  does  not  vanish,  we  have  a  unique  solution 

C+  C- 

K*  =  G1  “  G~  "  7  (say) *  B+  “  °*  (21) 

+ 

(The  equality  C /G  and  C  /G  may  be  assumed  during  the  evolution  since 
otherwise  the  differential  equation  (14a)  is  violated.)  Thus  the  velocity 
of  the  wave  is  determined.  On  the  other  hand,  if  G++G  does  vanish,  the 
solution  may  be  written 


B+  =  4(Kg-V)G+//v  with  undetermined.  (22) 

To  find  K*,  we  proceed^£he  the  next  step  in  the  calculations,  i.e.  the 
determinant  of  f-(£)*  Three  algebraic  equations,  corresponding  to  the  system 
(20)  are  obtained,  whose  determinant  is  again  G+4G_;  now,  however,  there  is 
a  consistency  requirement 

K*2  -  3VK*  +  2 (T _-T+)  -  0  with  T+  -  F™  (0*)  (23) 

when  the  determinant  vanishes.  If  there  is  a  value  of  K_,  it  must  satisfy 
this  quadratic  equation;  note  that  there  are  real  roots  if  and  only  if 

T_  -  T+  jc  3V2/8.  (24) 


In  the  calculations  of  Figure  3,  it  was  observed  that  the  numerical 
integration,  failed  around  T  *  80  x  10~^ .  The  values  of  the  slopes  G+  at 
the  origin  were  found  to  have  a  sum  G^+G_  close  to  zero.  The  quadratic 

(23)  presumably,  did  not  have  real  roots,  although  third  derivatives 

were  not  obtained  accurately  enough  to  exhibit  a  violation  of  the  inequality 

(24)  . 

We  have  not  been  able  to  reproduce  Stewart  and  Ludford’s  spikes  (Figure  5) 
in  our  computations  so  far  but,  if  they  exist,  the  above  analysis  provides 
an  explanation.  Instead  of  a  cusp,  there  is  actually  a  jump  in  K  from  its 
limiting  value  as  G+4G_  -*■  0  to  a  root  of  the  quadratic  (23).  Figure  '6 
sketches  such  jumpsT  which  could  appear  to  be  cusps  if  the  resolution  in  time 
were  not  good  enough. 

The  theory  is  currently  being  tested  numerically,  by  applying  initial 
conditions  for  which  G.-*G  =  0  and  G+/G+  =  G_/C_;  with  third  derivatives 
satisfying  and  violating  the  conditions  ( 24) .  We  are  also  trying  to 
find  a  (singular)  solution  to  take  over  when'  the  condition  is  violated, 
i.e.  when  there  is  no  finite  velocity  with  which  the  discontinuity  can  move 
away. 
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Pig.  5,  Response  of  CJ-detonation  in  Figure  2  to  a  slightly 
stronger  rarefaction.  At  breakdown  the  f-profile  is  indistin 
quishable  from  that  shown  for  T  *  0.4  . 
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FINITE  INCREMENT  FORMULATION  OF  THE  PRANDTL-REUiio 
CONSTITUTIVE  EQUATIONS 
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ABSTRACT.  The  early  finite  increment  formulations  of  elastic- 
plastic  constitutive  laws  for  use  in  large  deformation  finit e-element 
codes  placed  severe  limits  on  step  size  in  order  to  avoid  significant 
error  accumulation.  In  this  paper  finite  increment  forms  of  the  consti¬ 
tutive  equations  for  Prandtl-Reuss  materials  are  investigated,,  two 
primary  sources  of  finite  increment  error  are  identified,  and  proced¬ 
ures  for  elimination  of  these  errors  are  developed.  By  adopting  a 
geometric  description  in  nine-dimensional  stress  space,  the  relevant 
stress  and  strain  increments  are  shown  to  all  lie  in  a  two-dimensional 
subspace.  This  allows  the  nature  of  the  errors  and  the  description 
of  procedures  which  overcome  them  to  be  easily  and  precisely  visualised 
with  the  aid  of  planar  vector  diagrams. 

I.  INTRODUCTION.  In  order  to  perform  deformation  and  stress 
analyses  for  a  material  being  finitely  deformed,  a  spacially  discretized 
model  of  the  continuum  usually  must  be  introduced  and  this  is  commonly 
achieved  by  the  finite-element  method.  A  technique  must  then  be  devel¬ 
oped  to  solve  the  nonlinear  governing  equations.  The  character  of  the 
solution  technique  depends  in  part  on  the  nature  of  the  material's 
constitutive  law.  An  incremental  procedure  is  appropriate  in  the  case 
of  plasticity  because  of  its  path  dependent  nature.  This  requires  that 
the  constitutive  equations  be  cast  in  finite  increment  form  but,  when 
this  is  done  in  the  simplest  and  most  direct  fashion,  two  problems 
develop.  First,  stress  points  which  should  remain  on  a  convex  yield 
surface  are  forced  to  move  finite  distances  in  a  tangential  direction 
during  each  incremental  3tep  and  thus  they  drift  outward  from  the 
yield  surface  with  each  step.  Second,  stress  points  tend  to  drift 
off  of  the  strain  hardening  curve  for  essentially  the  same  reason. 

In  beth  cases  the  finite  increment  errors  accumulate  monotonically 
since  they  are  essentially  in  the  same  direction  at  each  step.  As 
will  be  shown,  suitable  modification  of  the  incremental  constitutive 
equations  can  completely  eliminate  such  errors. 

After  defining  the  physical  variables  of  interest  and  briefly 
reviewing  the  relevant  continuum  equations,  a  geometric  characteriza¬ 
tion  of  the  deviatoric  Prandtl-Reuss  constitutive  equations  in  nine¬ 
dimensional  Kirchhoff  stress  space  will  be  developed.  Then  it  will 
be  shown  that  the  relevant  stress,  stress  rate  and  deformation  rate 
tensors  all  lie  in  c  two-dimensional  subspace  of  stress  space. 

This  naturally  leads  to  a  simple  description  in  terms  of  a  planar 
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vector  diagram.  Finite-increment  forms  of  the  Prandtl-Reuss  equations 
will  then  be  analyzed  with  the  aid  of  the  corresponding  planar  vector 
diagram. 

For  the  case  of  a  nonstrain-hardening  material,  a  geometric  descrip¬ 
tion  will  be  giv-'"'  of  finite  increment  procedures  which  insure  that 
stress  poir ts  remain  on  the  fixed  yield  surface.  The  preferred  procedure 
will  ther  -e  generalized  to  the  case  of  strain  hardening  where  the  added 
complicate*  of  drift  from  the  strain  hardening  curve  arises.  A  geo¬ 
metric  description  of  a  procedure  for  precise  *  ;acking  of  the  strain 
hardening  curve  will  be  given. 

II.  CONT-JUUM  EQUILIBRIUM  AND  COFfTITUTIVE  EQUATIONS.  In  recent 
years  it  has  become  possible  to  apply  the  finite-element  method  to  the 
difficult  problem  of  analyzing  metal  forming  processes  involving  large 
deformation,  ’iill  [1]  developed  the  governing  continuum  equations  (in 
v« fictional  form)  for  rate  independent  materials  at  finite  strain  and 
iilbbi*:*:,  Marcal  and  Rice  [  2]  developed  a  corresponding  Lagrangian 
,!lnit:*.  ■<  ii.sr.ent  procedure.  McMeeking  and  Rice  [3]  proposed  an  alterna¬ 
tive  £ 4  n  *  ■  ^-element  procedure  emphasizing  the  current  rather  than  the 
ini< '  .u  configuration  because  it  is  more  appropriate  for  analyzing 
pla-  i.i  material  behavior.  The  variational  basis  of  their  method  will 
be  given  following  a  brief  review  of  relevant  definitions  and  notation. 

Let  Xj  denote  the  three  rectangular  Cartesian  coordinates  of  a 
material  particle  in  the  current  (deformed)  configuration  and  let  Xj 
deno'L-j  its  coordinates  in  the  reference  (unde  formed)  configuration. 

The  particle's  trajectory  is  then  given  by 

*i  =  Xi  (Xi ,  X£,  X3,  t) ,  i*  1,2,3  (I) 

where  t  is  the  elapsed  time.  Taking  first  derivatives  gives  the  material 
velocity .vector  V  and  the  deformation  gradient  tensor  F 


A  second  differentiation  gives  the  velocity  gradient  r.eusor  L  whose 
symmetric  and  skew-symmetric  parts  are  the  deformation  rate  ,D  and  the 
material  spin  W,  respectively 


L 


-c 


av.*  1 

=  tF'1 
,ox  ^  /  ~  ~ 


D4W 


(3) 


The  material  mass  density  ratio  is  given  by 


-.7  —JK. rj*  rjt  7JL  ru*  ~Ji  ^  it  _TJA  r_> 
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det  F 


When  finite  deformations  are  considered,  care  must  be  taken  to 
distinguish  the  various  possible  stress  tensor  definitions.  When  the 
current  configuration  is  emphasized,  the  Kirchhoff  stress  t  and  the  true 
or  Cauchy  stress  o  are  particularly  useful.  They  are  related  by 

ct  *  t  (5) 


It  is  also  necessary  to  distinguish  various  rates  of  stress.  Constitu¬ 
tive-  laws,  for  example,  require  the  use  of  an  objective  stress  rate  but 
it  is  the  material  rate  that  must  be  integrated  to  determine  stress 
evolution.  The  material  rate  t  and  the  objective  Jaumann  rate  t  of 
Kirchhoff  stress  are  related  by  the  equation 

t*t+Wt-tW  (6) 

/v»  /v 

which  can  be  interpreted  as  a  decomposition  of  t  separating  the  effects 
of  material  deformation  and  rotation. 

The  (corrected)  rate  form  of  the  principle  of  virtual  work  developed 
by  KcMeeking  and  Rice  is 

1 1  Pu®u  '  2  V(2DlkV-\i\,J>]  *"••••  (7> 


where  integration  is  over  the  volume  of  the  current  configuration  and 
the  missing  terms  on  the  right-hand  side  account  for  the  effects  of 
prescribed  surface  tractions  and  body  forces.  It  is  the  velocity  field 
that  is  subject  to  variation  and  the  required  auxiliary  equations  are 
the  constitutive  law  in  the  form 


Tij  “  ^ijk  flu 


and  the  mass  conservation  law 


P  ="  * 


After  spacial  (finite-element)  discretization,  Eqs.(7)  and  (8)  are 
applied  incrementally  with  the  determined  velocity  field  used  to 
update  the  configuration  and  Eqs.(6)  and  (9)  used  to  update  the  material 
density  and  stress  fields  at  the  end  of  each  incremental  loading  step. 
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In  order  to  cast  the  constitutive  equations  fo.  a  tine  independent 
elastic-plastic  naterial  in  the  form  of  Eq.(8),  it  is  necessary  to 
obtain  a  rate  form  of  the  elasticity  equations  and  combine  them  with  the 
appropriate  plasticity  equations.  This  is  most  easily  accomplished  by 
making  the  kinematic  assumption  that  the  deformation  rate  can  be  linearly 
decomposed  into  elastic  and  plastic  parts 


D«De  +  Dp 

ru  A* 


(10) 


The  desired  result  is  obtained  by  substitution  from  the  elasticity 
relation 


_e* 
D  m  C  T 


dlV 


and  the  plasticity  flow  rule 


_p  1  /a f  *\  Sf 
~  *  h  \at  aT  * 


(12) 


where  h  denotes  the  tangent  modulus  of  the  strain  hardening  law  and 
f  denotes  the  yield  function  in  the  yield  criterion 

f(T)«Y, 

and  by  inverting  the  resulting  equation  to  solve  for  t. 


(13) 


For  the  case  of  isotropic  elasticity,  a  Mises  yield  criterion  and 
isotropic  strain  hardening,  the  above  procedure  produces  the  equations 
for  a  Prandtl-Reuss  material  characterised  by  two  elastic  constants, 
such  as  the  shear  modulus  G  and  the  bulk  modulus  K,  and  a  strain 
hardening  curve 

Y«Y(ep)  (14) 

where  Y  is  the  yield  stress  and  t9  is  the  equivalent  plastic  strain 
determined  by  integration  of 


!p.(f» 


P  DP  ] 

ij  ij/ 


l/< 


(15) 


Separating  the  result  into  hydrostatic  and  deviatoric  parts  gives 


*ii-®ii 


(16) 
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where  primes  denote  deviators,  r  the  equivalent  Kirchhoff  stress 


1/2 

> 


(18) 


h  the  tangent  modulus 

h  =  (19) 

dep 

and  where  8  is  unity  during  plastic  flow  (t«Y)  and  zero  otherwise.  Since 
the  situation  is  trivial  when  plastic  flow  is  not  occurring,  we  shall  only 
consider  the  case  of  plastic  flow  in  the  remainder  of  the  paper  and  thus 
P  will  be  set  equal  to  unity. 


III.  GEOMETRIC  REPRESENTATION  OF  STRESS  TENSORS.  It  is  easily 
verified  that  a  stress  tensor  a  may  be  regarded  as  a  vector  in  a  nine- 
dimensional  vector  space.  In  particular,  its  Cartesian  components  oti 
can  be  interpreted  as  nine  rectangular  Cartesian  coordinates  of  a 
point  in  a  nine-dimensional  Euclidean  space  called  stress  space.  Since 
ct  is  a  symmetric  tensor,  it  actually  lies  in  a  six-dimensional  subspace 
of  stress  space  called  reduced  stress  space.  The  usual  decomposition 
of  a  stress  tensor  into  its  deviatoric  and  hydrostatic  parts. 


7ij  *  aij  +  3  6ijakk 


(20) 


becomes  a  vector  decomposition  into  two  orthogonal  components,  the 
orthogonality  resulting  from  the  fact  that 


1 

3 


Vi*' 


(21) 


Thus  reduced  stress  space  can  be  decomposed  into  two  orthogonal  sub¬ 
spaces,  a  one-dimensional  subspace  of  hydrostatic  stresses  and  a  five¬ 
dimensional  subspace  of  deviatoric  stresses. 

A  direct  consequence  of  the  vector  representation  is  the  fact  that 
stresses  can  be  visualized  with  ordinary  vector  diagrams  whenever  they 
lie  in  a  subspace  of  dimension  three  or  less.  The  orthogonal  decompo¬ 
sition  just  mentioned,  for  example,  can  be  represented  by  a  planar 
figure  Where  crd9V  and  oj,yd  are  perpendicular  and  have  lengths  of 
(or/j  o(i  )3/s  and  respectively  (Figure  1). 


Figure  1  Decomposition  of  Stress 


Another  easily  visualized  situation  occurs  when  the  principal 
directions  of  the  stress  tensors  under  consideration  coincide  and  remain 
fixed  (in  physical  space)  so  that  the  stress  tensors  all  lii  in  a  three- 
dimensional  subspace  of  stress  space  called  principal  stress  space. 
Decomposition  of  this  subspace  into  its  hydrostatic  an.  deviatoric  parts 
leads  to  the  veil  known  II  plane.  Unfortunately;  this  representation  is 
often  misused  by  employing  it  whea  rv  ».  princ.ip.il  directions  do  not  in 
fact  remain  fixed. 

The  deviatoric  Prandtl-Reuss  equations  can  be  written  in  symbolic 
tensor  form  as 


T 


(22) 


If  we  permit  stress  rates  to  be  i. .  t resent- r *  as  sectors  in  stress  space 
(since  they  are  in  essence  stress  increments),  then  Eq.(22)  can  be 
interpreted  as  a  decomposition  of  the  stress  rate  vector  t/  into  two 
'  jmnonents,  2GD '  and  2GDP,  in  Kirchhoff  stress  space.  Since  2GDP  is 
oroporriional  to  r  ,  and  thus  the  corresponding  vectors  are  parallel, 
it  is  clear  that  all  of  the  vectors  of  interest  are  contained  in  the 
two-dimensional  subspace  spanned  by  the  stress  vector  r'  and  the  stress 
rate  vector  2GD '  and  can  be  represented  in  a  planar  diagram. 

If  a  unit  vector  a  in  the  direction  of  r‘  is  introduced,  a  more 
explicit  form  of  Eq. (2^)  is  obtained,  namely? 


*/ 

T 


l-fh/3G 


n(n  •  D7)"!  =  P(h,n)  «D' 


(23) 


where  P(h,n)  ;.csignates  a  linear  vector  operator.  Note 
a  projection  operator  in  the  case  of  perfect  plasticity 
it  projects  D'  onto  the  hyperplane  normal  to  n.  Noting 
yield  surface 


2 

3 


Y 


2 


that  P  becomes 
(h  =  0)  since 
that  the  Mises 

(24) 


becomes  a  circle  in  the  taro -dimensional  subspace,  Figure  2  is  obtained. 
Note  that  n  can  be  interpreted  as  the  yield  surface  unit  normal  vector. 


Figure  3  Finite  Increment  Prandtl-Reuss  Diagram  for  Perfect 
Plasticity  Case  (h  *  0) 


As  indicated  previously  this  result  is  unacceptable  because  the 
stress  vector  t7  +  At7  does  not  remain  on  the  yield  surface  as  it  should 
One  somewhat  crude  but  often  used  procedure  to  eliminate  this  defect, 
called  radial  return,  is  to  simply  shift  the  tip  of  the  At7  vector 
toward  the  origin  until  it  meets  the  Mises  yield  circle.  Another  is  to 
let  the  tip  of  the  At7  vector  coincide  with  the  point  of  the  yield 
circle  determined  by~ the  line  from  the  origin  to  the  tip  of  the  2GAe7 
vector.  A  much  more  rational  procedure  is  obtained  by .noting  that 
improved  accuracy  is  always  obtained  by  using  uidstep  values  of  tha 
coefficients  in  an  equation  relating  increments.  In  the  current  situa¬ 
tion  that  suggests  that  we  replace  the  initial  yield  surface  normal  n 
by  some  midstep  nonual  n,  so  that 


Ar7=2G(I-hn)  •  Ae7. 

A#  ru  A* 


In  particular;  if  we  determine  the  locus  of  the  tip  of  the  At7  vector 
for  all  possible  vectors  n  and  determine  where  that  locus  intersects 
the  yield  circle,  the  best  n  will  be  determined.  Since  Eq, (27) 
forces  At7  to  be  perpendicular  to  n,  the  desired  locus  is  a  circle  with 
the  vector  2GAe7  as  a -diameter  as  shown  in  Figure  4.  Simple  geometry 


Yield  Surface 


Figure  4  Modified  Finite  Increment  Prandtl-Reuss  Diagram 
for  Perfect  Plasticity  Case  (h  =  0) 


then  shows  that  the  n  vector  which  keeps  the  stress  vector  t/  +  At/  on  the 
yield  circle  is  directed  along  the  line  connecting  the  origin  and  the 
center  of  the  circular  locus,  i.e. 


T'  +  GAe' 

K^e'l 


(28) 


This  result  was  originally  given  by  Rice  and  Tracey  f4]  rfio  defined 
n  as  a  unit  vector  in  the  direction  of  the  average  of  t '  and  t#  +  2GAe' 
and  then  demonstrated  that  n  has  the  "remarkable  feature"  that  the 


stress  tensor  t+  At  obtained  by  using  n  in  place  of  n  satisfies  the 
yield  criterion  precisely.  Since  they  represented  stress  states  as 
vectors  in  the  n-plane,  their  result  should  have  been  restricted  to 
cases  where  principal  directions  remain  fixed  in  the  material.  However, 
we  have  shown  here  that,  if  stress  states  are  properly  represented  as 
vectors,  the  result  can  be  derived  and  is  not  subject  to  restrictions. 


The  dotted  lines  in  Figure  4  indicate  the  results  that  would  be 
given  by  Eq.(27)  if  the  yield  surface  normal  at  the  beginning  or  end  of 
the  step  were  used  for  n.  Note  that  if  radial  return  was  used  in  these 
cases,  to  force  the  stress  point  to  lie  on  the  yield  circle,  the  resulting 


Locus  of  (e^+Ae**,  t+At)  for  all  Possible  Values  of  h 


Strain  Hardening  Curve 


Figure  6  Diagram  for  Determining  Midstep  Tangent  Modulus 

Once  again  it  is  advisable  to  try  to  find  an  appropriate  midstep 
value,  this  time  for  the  tangent  modulus  h  so  that  the  stress  point 
remains  on  the  strain  hardening  curve.  This  can  be  accomplished  by 
finding  the  locus  of  the  points  (e5  4- As5 ,  t+At)  corresponding  to  all 
positive  values  of  h  and  determine  where  this  locus  intersects  the 
strain  hardening  curve. 

An  equation  for  the  locus  is  easily  determined  by  resolving  the 
vector  t,  +  At/  into  components  parallel  and  perpendicular  to  n  and 
theu  expressing  its  squared  length  as  the  sum  of  the  squares  of  its 
components.  Referring  to  Figure  5  gives  the  following  result: 

l.j'  +  Aj'|2-(s  •  <t'+2G4€')  -  |2G4£P|)‘ +  |G4c'|2-  (fi  ( 


Noting  that  the  last  two  terms  can  be  alternately  expressed  as 
It' Is  -  (n  •  t')s  and  that 

I  «w  I  Vv  /v» 

T+Af  =  ffSJl  It'  +  At'I 
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Aep  =  ^2/3  | Aep |  , 


the  equation  of  the  locus  becomes 

(t  +  At)2=  (P-  3GAep) 


_p, 2  .2  _  2 

l£F)  +  t  -  Q 


(34) 


where 


Sfl 


P  =  -y/3/2  n  •  (t /+  2GAe ') 


(3 
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Q  -  Jm  n 


(36) 


r.- 


LVj 


i 


a 

i 

i 


This  equation  relating  At  and  Aep  is  clearly  a  hyperbola.  Its  upper 
branch  intersects  the  strain  hardening  curve,  as  indicated  in  Figure  6, 
and  thus  determines  a  chord  whose  slope  is  the  desired  midstep  value 
of  the  tangent  modulus  h.  In  fact,  if  the  strain  hardening  curve  is 
specified  in  tabular  fashion,  so  that  it  is  in  effect  given  as  a  piece- 
wise  linear  function,  then  simple  closed  form  expressions  for  A e? ,  At 
and  h  =  At/ A?5  can  be  obtained. 

The  accuracy  of  this  procedure  has  been  verified  by  comparing  it3 
results  with  the  results  obtained  by  numerical  integration  for  cases 
where  D'  remains  constant  over  the  time  interval  At.  It  was  founu  to 
be  highly  accurate  even  for  large  increment  cases  where  1 2GAG7 |  =  j t' | . 

In  situations  where  a  stress  point  lies  on  the  yield  surface  but 
does  not  move  very  much  over  a  period  of  time  the  above  procedure  is 
very  stable.  When  less  accurate  procedures  are  used  for  the  strain¬ 
hardening  case  there,  is  a  tendency  for  such  a  stress  point  to  vascil- 
late  between  plastic  loading  and  elastic  unloading  in  successive  steps. 
This  can  cause  considerable  difficulty  because  the  incremental  stiff¬ 
ness  matrix  must  be  recomputed  whenever  a  stress  point  changes  from 
loading  to  unloading  or  vice  versa  and  this  change  can  cause  other 
stress  points  to  change  which  in  turn  can  cause  the  first  stress  point 
to  change  again.  In  fact  there  is  the  possibility  that  this  sequence 
of  recomputations  would  never  end,  although  we  have  never  encountered 
such  a  situation  in  practice. 

VI.  MIDSTEP  YIELDING.  Since  the  onset  of  yielding  represents  a 
discontinuity  in  material  behavior,  it  would  seem  advantageous  to 
adjust  the  fize  of  each  incremental  step  so  that  such  discontinuities 
always  occur  at  the  end  of  a  step.  However,  this  would  result  in  a 
large  number  of  very  small  steps.  Marcal  and  King  [5]  suggested  that 
such  a  limitation  on  step  size  could  be  avoided  by  allowing  midstep 
yielding  as  indicated  in  Figure  7. 
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VII.  TOTAL  STRESS  INCREMENT.  The  total  stress  increment 


Ar=rAt  (AO) 


must  be  determined  in  order  to  update  stress  at  the  end  of  each  incre¬ 
mental  step.  So  far  we  have  only  determined  At',  the  deviatoric  pgrt 
of  the  stress  increment,  corresponding  to  the  Jaumann  stress  rate  t. 

Two  additional  stress  increments  are  required,  one  corresponding  to 
the  deviatoric  part  of  f  and  another  corresponding  to  the  tensor  Wt-tW 
appearing  in  Eq. (6)  vhich  accounts  for  rotational  effects.  Since 
Wt-tW  is  deviatoric,  tue  first  of  these  two  stress  increments  is  simply 
the  increment  in  Th  yd  which,  according  to  Eq.(16),  can  be  written  in 
the  stress  vector  form 


eAT. .  =  eKD  At 
~  li  ~  xi 


(Al) 


where  e  is  the  vector  in  stress  space  corresponding  to  the  tensor  6j  3 . 
o 

If  we  let  t  designate  the  stress  vector  corresponding  to  the 
tensor  Wt-tW,  its  increment  becomes 

rv  A/' 

Art  =  rAt .  (A2) 


The  finite  increment  form  of  Eq. (6)  then  becomes 

^=ATyhd+AT/  +  ART  (A3' 

and  it  remains  to  find  an  expression  for  Z^t. 

QIt  is  readily  shown  that  is  deviatoric  and  that  Tjyd,  Tdov  (=t/) 
and  t  are  directed  in  mutually  orthogonal  directions  in  stress  space. 

In  the  absence  of  deformation  rates  T},yd  remains  fixed  and  t/  changes 
direction  but  remains  fixed  in  length  (and  thus  remains  on  a  hypersphere 
in  the  five-dimensional  stress  deviator  subspace).  If  the  initial 
value  of  t  is  used  in  Eq.(A2),  then  ArT,  is  orthogonal  to  t'  and  thus  t' 
grows  in  length  during  the  incremental  step  and  gives  another  example 
of  a  finite  increment  error  which  is  biased  so  that  it  accumulates 
monotonically  with  each  step.  The  simplest  remedy  is  to  apply  the 
radial  return  technique,  i.e.,  reduce  the  length  of  t' +  Art  without 
changing  its  direction,  at  the  end  of  each  step. 

The  more  rational  approach  of  employing  a  midstep  value  of  t  in 
Eq. (A2)  is  complicated  by  the  fact  that  t'  does  not  remain  confined  to 
a  two-dimensional  subspace  of  stress  space.  Fortunately  an  exact  solu¬ 
tion  for  the  increment  itself  can  be  obtained  for  the  case  of  constant 
spin.  The  rotation  tensor  R(t),  associated  with  a  spin  tensor  W(t')  is 
the  solution  of  the  initial  value  problem 


r«KT3K£SrSUM  W-l  WTTWWVL^TJ  KWJ  W'-  «T-  ifVi.*-— *~- -urj- 


*r^  »*  w  «f^.»j-< 


R  =  W  R  ,  R(0)  =  I  . 

/V*  <v<v  '  oi  rv 


(44) 


and  for  constant  W  that  solution,  with  the  help  of  the  Cayley-Hanilton 
theorem,  becomes 


where,  at  t  =•  At, 


R  =  e~C=  I+c,W+c0W2 

<V» 


sin(cuAt)  _  1  -  cos(tuAt) 

C1  =  U)  '  C2  =  2 

0) 


2  1  „,2. 

oj  =  -  -  tr(£  )  . 


The  exact  rotational  stress  increment  is  given  by 


t+  A,jT=  Rt  R  . 


(45) 

(46) 

(47) 


(48) 


Substitution  of  the  expression  for  R  leads  to 


*R~  ~  clh  +  c2<§2  “  C1C2<^3  +  C2$4 


where  the  four  deviatoric  tensors  (3.  are  defined  as 


h  =  2 


=  W  T  -  T  W 


/v/  <v 


B0  «  W2T+tW2  -  2W  T  W 

<V^  (V IV  ro  ^V/ 


B0  =  t  - T W)W  B.  =  W2tW2+ 

#Vj  A//W  »V  <V  «v£f  A;  A/A/  <V 


(49) 


(50) 


Note  that  the  four  terms  in  Eq.(49)  are  of  successively  higher  order 
in  At. 

In  the  general  case,  where  •r/  and  the  four  Bj  are  linearly  inde¬ 
pendent,  the  locus  of  the  stress~point  determined  by  t'  +  ArT  (for  all 
values  of  At),  which  is  a  path  on  a  hypersphere  of  radius  enters 

all  five  dimensions  of  the  stress  deviator  subspace.  On  the  other 
hand,  in  the  special  case  of  plane  strain  it  turns  out  that  £3  =  0?^ 
and  B4  =  -  l/2‘u?^g2  so  that  t#+Art  is  confined  to  the  subspace  spanned  by 
z' f  and  Jg  and  a  three-dimensional  visualization  is  possible.  In 
this  case  the  locus  of  the  stress  point  lies  on  the  intersection  of  a 
plane  and  a  sphere,  i.e.,  on  a  circle,  as  indicated  in  Figure  8. 
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Figure  8  Rotational  Stress  Increment  for  Plane  Strain  with 
Constant  Spin 

Finally,  it  should  be  pointed  out  that  the  trigonometric  functions 
in  Eq.(46)  can  be  replaced  by  rational  approximations,  for  computational 
simplicity,  without  reintroducing  the  biased  finite  increment  error. 

This  is  accomplished  by  making  the  substitution 


sin(caAt)  =  .  ,  l-cos(tuAt) 

2f2 

1 

(51) 

1+  f 

1+  f 

where 

*  .  “At 

f  =  tan  ~~2~ 

(52) 

and  then  introducing  any  rational  approximation  for  f.  The  simplest 
approximation,  f  cuAt/2,  fo-.-  example,  causes  an  angle  of  U)At  * 

0.1  tt  radians  *  18°  to  be  approximated  by  17.85°.  It  can  be  shown  that 
this  particular  approximation  is  identical  to  the  one  suggested  by 
Hughes  and  Winget  [6J. 

VIII.  SUMMARY.  A  geometric  characterization  of  stress  has  been 
used  to  describe  certain  finite-increment  errors  which  arise  when 
incremental  expressions,  that  are  only  first  order  in  At  are  inployed. 
These  errors  are  biased  in  one  direction  so  that  they  accumulate 


monotonically  in  successive  incremental  steps.  An  easily  visualized 
characterization  was  described  for  a  procedure  developed  by  Rice  and 
Tracey  [A]  to  eliminate  such  an  error,  namely  drift  from  the  yield 
surface  in  the  case  of  perfect  plasticity.  Their  technique  was  general¬ 
ized  to  include  the  case  of  strain  hardening  where  the  additional 
problem  of  drift  from  the  strain  hardening  curve  had  to  be  eliminated 
also.  The  technique  of  Marcal  and  King  [5]  to  allow  midstep  yielding 
was  then  incorporated  into  the  procedure.  Finally,  a  source  of  biased 
f ini te- increment  error  in  treating  the  effect  of  material  spin  was 
characterized  and  a  procedure  for  eliminating  it  was  given.  Although 
a  geometric  visualization  was  not  possible  in  the  general  case,  a 
three-dimensional  illustration  was  given  for  the  plane  strain  case. 
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CONSTITUTIVE  FEATURES  OF  SOLIDS 
AT  SHOCK-WAVE  LOADING  RATES 


Dennis  E.  Gradyf 

Thermorpechanical  and  Physical  Division 
Sandia  National  Laboratories 
Albuquerque,  NM  87185 


Abstract.  Solids  subjected  to  high-velocity  impact  or  explosive  loading 
exhibit  unusual  transient  and  pcrt-shock  properties  during  the  extremely  brief 
period  associated  with  the  shock-wave  risetime  and  release.  These  features  can 
include  a  unique  solid-state  shock  viscosity  behavior,  anomalous  transient  shock¬ 
hardening  effects,  heterogeneous  shear  effects  during  the  shock  risetime,  and  shock- 
induced  solid  state  and  metallurgical  transformations.  Improved  methods  in  time- 
resolved  instrumentation  have  been  critical  in  the  emerging  understanding  of  these 
constitutive  features.  An  increasing  sophistication  in  physical  and  computational 
modeling  is  required  to  incorporate  these  effects  in  applied  problems  of  dynamic 
solid  mechanics. 

I.  SHOCK- Wave  Concepts.  An  explosion,  radiation  deposition,  or  high- 
velocity  impact  can  lead  to  a  brief  but  intense  pressure  loading  of  a  solid  body 
through  the  propagation  of  a  compression  shock  wave.  Pressures  achieved  by  con¬ 
ventional  methods  range  from  a  few  to  several  hundred  GPa.  Fundamental  properties 
of  shock  waves  are  mo3t  readily  appreciated  through  consideration  of  a  normal 
one-dimensional  shook  such  as  mH’.ht  be  produced  by  the  planar  impact  of  flat 
plates  [1-3].  The  shock  transition  in  a  single  phase  material  occurs  through  the 
passage  of  a  pressure  wave  with  a  risetime  so  brief  that  it  is  usually  regarded 
as  a  discontinuity.  The  shock  state  is  characterized  by  changes  in  the  pressure, 
P,  material  velocity,  up,  specific  volume,  V,  and  specific  internal  energy,  E.  The 
kinematics  of  the  wave  are  determined  by  a  unique  shock  velocity,  l/«.  Fundamental 
laws  governing  conservation  of  mass,  momentum,  and  energy  lead  to  the  Hugoniot 
conditions  relating  variables  through  the  shock  transition, 


V/V0  =  (U.-up)/U„ 


(1) 


f  Thk  \.ork  performed  at  Sandia  National  Laboratories  supported  by  the  U.  S. 
Department  of  Energy  under  contract  number  DE-AC04-76-DP00789. 
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P  —  P,=  V.  «p/V„  (2) 


£'-£„  =  i(P  +  PoKV-V<1).  (S) 

A  thermodynamic  description  of  the  shock  transition  for  a  particular  material 
is  completed  with  a  representation  for  the  Hugoniot,  which  is  an  experimentally 
determined  pressure-volume  relation  for  the  locus  of  shock  pressure-shock  volume 
states  achieved  through  a  sequence  of  increasing  amplitude  shock  waves. 

Under  sufficiently  high  shock  pressures  in  a  solid,  material  response  hs*s  usually 
been  regarded  as  fluid  in  the  sense  that  deviatoric  stress  and  strain  values  are 
neglected  in  relation  to  hydrodynamic  values  [1-3].  In  shock  loading  to  pressures  of 
about  20  GPa  or  lower  in  metals  and  perhaps  twice  that  in  refractory  materials, 
this  assumption  is  becoming  recognized  as  a  poor  approximation  in  that  effects  of 
material  strength  can  profoundly  influence  physical  processes  [4-6].  In  addition,  the 
fact  that  a  shock  wave  has  a  brief  but  finite  risetime  cannot  always  be  ignored  in 
consideration  of  the  mechanisms  for  the  shock-induced  processes  [7,8]. 

il_  MlC!RQS.XR.U.QXllBAL  FEATVitS3JIi,£«E.  SHQCKJTflANSITION. 
Microstructural  features  in  the  deforming  solid  are  becoming  increasingly  recog¬ 
nized  as  important  to  the  stress  wave  and  flow  process.  Constitutive  modeling  efforts 
are  tending  toward  better  understanding  and  explicit  treatment  of  the  material 
microstructure  in  addressing  elastic  responses,  strength,  the  yield  process,  flow 
and  phase  transformation.  Two  microstructural  aspects  appear  important.  First 
is  the  pre-existing  microstructure  including  grain  structure,  porosity,  and  inter¬ 
nal  stress  fields.  The  second  is  a  transient  deformation  microstructure  induced 
during  the  shock  process.  The  former  microstructure  dominates  wave  propaga¬ 
tion  for  stress  amplitudes  near  the  strength  limit  of  the  materiel  but  becomes  of 
decreasing  importance  for  shocks  significantly  stronger  than  the  material  strength. 
:  .  contrast,  an  induced  deformation  micros  *ucture  achieves  increasing  importance 
ic  the  extreme  high-rate  flow  associated  with  a  strong  shock  wave.  These  microstruc¬ 
tural  effects  appear  to  play  a  significant  role  in  both  stress-wave  propagation  and 
the  material  processes  occurring  under  stress-wave  loading. 

The  pre-existing  microstructure  of  a  solid  governs  the  details  of  nonlinear 
stress  waves  which  load  the  material  beyond  the  level  of  plastic  yield  or  fracture. 
Microstructural  features  profoundly  influence  elastic  wave  propagation  through  fre¬ 
quency  dependence  and  dispersion.  In  large  amplitude  wave  propagation  such  elas¬ 
tic  response  leads  to  dispersed  or  ramped  waves  and  governs  the  loading  rate  at 
which  yield  or  other  critical  stress  levels  are  achieved.  Grain  structure,  through  size 
and  anisotropy,  affects  onset  of  plastic  flow  or  brittle  failure.  Grain  size  influences 

the  yield  stress  level  and,  under  impact  loading  rates,  can  lead  to  rate-dependent 
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yield  phenomena.  Grr.in  anisotropy  can  cause  broadening  of  the  yield  process  and 
leads  to  recoverable  elastic  response  m  multiple-wave  shock  loading  experiments 
[9j.  Porosity  is  accompanied  by  strong  local  stress  fluctuations,  providing  sites  for 
premature  yielding,  fracture  and  phase  transformation.  Large  local  deformation 
leads  to  transient  local  hot  spots  which  can  accelerate  thermally-activated  rate 
processes  under  shock  loading  Collapse  of  pore  volume  at  shock  deformation  rates 
involves  microinertial  effects,  which  can  lead  to  rate-dependent  response  of  the  bulk 
material  [10,11].  Dynamic  fracture  or  spall  is  governed  by  the  nucleation  rate  and 
growth  of  mierostructural  defects  and  flaw;;  inherent  in  the  material.  Under  impulse 
loading,  time-dependent  damage  growth  and  rate-dependent  spall  strengths  are  ob¬ 
served  [12,13].  Tensile  fracture  damage  can  occur  through  growth  and  coalescence  of 
a  population  of  microcracks  in  brittle  solids  or  through  void  nucleation  at  atomic  or 
mierostructural  defects  followed  by  catastrophic  cavitation  in  ductile  solids  [14,15]. 
The  intensity  of  damar:-  *.n  terms  o'?  the  density  of  microcracks  or  voids  per  unit 
volume  appears  to  be  controlled  by  the  loading  rate  through  a  balance  of  the  rates 
of  nucleation  and  growth,  and  micros*5  uctural  energy  effects  [16]. 

III.  HETEROGENEOUS  Shear.  As  the  amplitude  of  the  shock  wave  be¬ 
comes  increasingly  larger  than  the  strength  of  the  material,  existing  microstructure 
properties  become  less  important.  Complex  wave  structure  due  to  yielding  or  phase 
transformation  tends  to  become  overdriven  and  the  shock  wave  degenerates  toward 
an  extremely  rapid  rising  pressure  pulse,  perhaps  a  few  nanoseconds  in  duration 
and  a  few  tens  of  micrometers  in  extent.  The  width  of  the  wave  is  governed  by 
an  effective  viscous  response  of  all  of  the  dissipative  processes  occurring  within 
the  shock.  Although  gross  microstructure  such  as  porosity  continues  to  influence 
the  shock  process  through  void  volume  crushup  and  intense  shock  heating,  more 
subtle  microstructure,  such  as  grain  size  and  orientation,  and  existing  dislocation 
structure,  defects,  or  impurities,  seems  to  be  of  lesser  importance. 

Modeling  the  shock  pressure  as  a  brief,  intense,  homogeneous  deformation  from 
the  initial  to  the  shock  state  with  the  attendant  homogeneous  temperature  and 
entropy  rise  appears  to  be  too  simplistic,  however.  This  approach  is  incapable  of 
explaining  a  rich  body  of  shock-wave  phenomena,  including  electric  and  magnetic 
effects,  partial  melting  and  thermally-activated  solid  state  transformations,  shock- 
induced  chemical  changes,  and  a  host  of  metallurgical  effects.  A  large  body  of  post¬ 
shock  metallurgical  investigation  exists  in  the  literature  (see,  for  instance,  Mikkola 
[17],  Murr  [18],  Grady  et  at  [8]).  Although  fraught  with  interpretational  difficulties 
due  to  uncertainties  in  the  shock  unloading  path  and  post-shock  metallurgical 
changes,  metallographic  optical  and  transmission  electron  microscope  studies  seem 
to  indicate  that  the  shock  deformation  is  an  extremely  heterogenous  and  turbulent 
process.  A  highly  heterogeneous  deformation  process  is  further  strengthened  by 
recent  advances  in  metb  ids  for  measuring  time-resolved  stress  waves  in  solids  which 
reveai  features  in  the  shock  deformation  wave  structure  that  are  not  easily  reconciled 
with  present  theories  of  homogeneous  shock  deformation  and  high  rate  flow  [19,20]. 


Recent  theoretical  efforts  which  attempt  to  account  for  the  heterogeneous  defor¬ 
mation  process  during  shock  loading  reveal  that  significantly  higher  than  average 
temperatures  may  persist  briefly  in  deformation  zones  within  the  shock  wave  [7,21,22], 
and  calculations  indicate  local  temperature  rises  of  a  few  hundred  to  a  few  thousand 
degrees  Kelvin,  depending  on  the  magnitude  of  dissipation,  the  thermal  conduc¬ 
tivity,  and  the  mass  fraction  of  intensely  deformed  material.  Such  temperatures 
are  sufficient  to  complete  phase  transformation  by  thermal  activation  within  the 
shock  risetime  and  localized  melting  within  the  shock  can  occur  [23].  Dimensions 
of  shear  zones  and  temperature  gradients  expected  to  occur  after  passage  of  the 
shock  wave  suggest  extremely  rapid  cooling  rates  (on  the  order  of  10n  K/s),  capable 
of  quenching  transformed  material  and  submicrostructure  within  the  high-pressure 
shear-banded  material. 

IV.  SHOCK-WAVE  Viscosity.  An  important  aspect  of  the  microstructure 
and  material  property  changes  which  occur  during  passage  of  a  shock  wave  is  the 
time  duration  within  which  they  must  occur.  Improved  methods  for  measuring 
time-resolved  profiles  show  that  shock  compression  is  not  discontinuous  but  occurs 
within  one  to  a  few  hundred  ns  over  the  stress  range  of  about  1  to  10  GPa  with  the 
risetime  decreasing  rapidly  with  increasing  stress  amplitude.  Irreversible  deforma¬ 
tion  processes  occur  within  the  plastic  portion  of  the  wave.  The  plastic  wave  can 
change  with  time  during  the  early  evolution  of  the  profile  but  achieves  a  steady 
shape  after  a  short  propagation  distance  due  to  a  balance  between  the  nonlinearity 
of  the  material  compression  behavior  and  rate-dependent  dissipative  processes  which 
tend  to  disperse  the  wave. 

In  the  hydrodynamic  approximation  of  shock  compression,  a  viscous  relation 
has  been  found  useful  in  characterizing  material  behavior.  In  more  general  elastic- 
plastic  response,  a  more  complicated  viscous  behavior  is  expected.  It  has  been  useful, 
however,  to  classify  the  dissipation  over  a  steady-wave  shock  compression  process 
by  an  effective  viscosity.  The  viscosity  coefficient  is  quantified  experimentally  as 
the  ratio  of  the  maximum  viscous  stress,  which  is  proportional  to  the  maximum 
difference  between  the  Rayleigh  line  and  the  Hugoniot,  and  the  strain  rate  from  the 
maximum  slope  of  the  wave  profile. 

Recently,  steady-wave  profile  data  on  a  number  of  metals  and  nonmetals  have 
been  examined  for  risetime  behavior  [24].  A  plot  of  steady-wave  stress  jump  against 
strain  rate  for  materials  which  include  copper,  aluminum,  beryllium,  iron,  quartz, 
and  magnesium  oxide  indicate  unexpected  consistencies.  Strain  rate  increases  as 
the  fourth  power  of  the  stress  jump  for  all  material  examined.  This  implies  that 
the  shock  viscosity  decreases  as  the  square  root  of  strain  rate  exhibiting  a  non- 
Newtonian  behavior. 

Swegie  [24]  has  incorporated  a  square  root  viscous  relation  within  a  general 
Maxwell-like  plasticity  model  and  has  readily  reproduced  the  work  hardening  and 
steady-wave  response  observed  experimentally.  These  calculations  were  performed 
without  including  artificial  viscosity. 
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Factors  governing  shock  viscosity  and  risetime  of  the  plastic  wave  have  not  yet 
been  determined.  Viscous  flow  should  be  associated  with  the  microscopic  process  of 
dislocation  multiplication  and  motion,  vacancy  production,  precipitate  alteration, 
etc.  There  are  tentative  indications,  however,  that  shock  wave  risetimes  and  viscosity 
are  governed  by  more  fundamental,  mechanism  independent,  energy  principles.  If 
so,  the  microscopic  shock  process  would  be  an  effect  rather  than  a  cause,  occurring 
in  the  most  energetically  favorable  way,  consistent  with  the  time  constraints.  This 
idea  is  speculative,  but  it  is  clear  that  a  better  understanding  is  necessary  here 
before  a  comprehensive  theory  of  the  shock  deformation  process  will  emerge. 


V.  Anomalous  Shock  Hardening.  Recent  measurements  of  plastic 
wave  profiles  in  metals  such  as  aluminum  [4],  copper  [25],  and  beryllium  [26], 
indicate  strength  properties  at  the  Hugoniot  state  and  viscous  effects  within  the 
shock  front  which  are  unique  in  behavior  and  not  readily  explained  [4,8,25,26]. 
Attempts  to  rationalize  metallographic  studies  of  shocked  samples,  which  indicate 
strong  heterogeneities  in  the  microscale  deformation  with  the  very  high  rate  of 
flow  determined  from  the  measured  wave  profiles,  indicate  that  adiabatic  shear 
deformation  and  thermal  trapping  may  play  an  important  role  here  also. 

A  unique  shock  wave  experiment  in  which  a  second  unloading  or  reshock  wave 
is  passed  through  the  metal  within  microseconds  after  the  initial  deformation  shock 
wave  reveals  further  elastic-plastic  response  with  significantly  enhanced  material 
strength.  It  is  difficult  to  explain  this  effect  without  a  thermal  mechanism.  Usual 
concepts  of  plastic  flow  suggest  that  the  Hugoniot  stress  state  should  reside  cn  the 
yield  surface.  The  data  show  only  a  small  residua]  state  of  shear  stress  relative 
to  the  strength  at  Hugoniot  states  greater  than  about  5  GPa.  A  transition  from 
normal  elastic-plastic  response  to  the  observed  anomalous  behavior  appears  to  occur 
at  about  this  stress  level.  A  reduced  state  of  shear  stress  on  the  Hugoniot  and 
enhanced  strength  would  be  expected  if  the  flow  stress  were  small  and  if  some  rapid 
strength  recovery  mechanism  were  operating  during  the  microsecond  or  less  before 
the  strength  is  tested  with  a  release  or  reloading  wave.  Heterogeneous  deformation 
and  thermal  trapping  during  the  high-rate  deformation  process  would  be  expected 
to  cause  reduced  flow  stress,  and  microscale  thermal  quenching  after  passage  of  the 
shock  wave  could  provide  the  recovery  process.  Such  an  explanation  has  yet  to  be 
verified,  although,  model  calculations  indicate  local  shear  temperatures  consistent 
with  the  interpretation  [7]. 


YL  -Shock-Induced  Phase  Transformation.  Processes  of  coherent 
phase  transformation  occurring  under  shock-wave  compression  provide  a  strik¬ 
ing  example  of  phenomena  affected  by  microstructures.  Coherent  transformation 
processes  include  recrystallization  and  twinning,  coherent  precipitation,  and  dis- 
placive,  martensitic  or  semi-reconstructive  transformations  in  solids,  although  all 
of  these  have  not  yet  been  observed  under  shock  loading.  These  processes  are  com¬ 
monly  reversible  or  exhibit  little  hysteresis  in  the  transition  between  states  which 
occur  through  the  motion  of  a  coherent  interface.  Further,  nonhydrcstatic  stresses 
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markedly  influence  tbe  conditions  of  phase  coexistence  both  in  the  bulk  and  on 
the  microscale  where  structural  defects  provide  sites  of  second-phase  uucleation. 
Paterson  [27]  has  reviewed  the  theoretical  development  of  nonhydrostatic  thermo¬ 
dynamics  applicable  to  coherent  transformation  prior  to  1973,  and  several  authors, 
including  Kamb  [28],  Fletcher  [29],  and  Robin  [30],  have  noted  the  difficulty  in  es¬ 
tablishing  generalized  thermodynamic  potentials  independent  of  the  specific  process. 
They  note  that  coexistence  conditions  depend  on  the  coherent  interface  orientation 
with  respect  to  crystal  axes  and  interface  accommodated  stress  discontinuities  in 
those  components  not  required  for  stress  equilibrium. 

The  I-H  transformation  in  the  naturally  occurring  mineral  ealcite  is  a  coherent 
displacive  transformation  which  has  been  investigated  extensively  under  stress-wave 
loading  [31,32].  The  transformation  initiates  and  proceeds  within  the  elastic  range 
of  the  material  and,  in  polycrystaliine  specimens,  is  sensitive  to  both  the  shear  stress 
state  and  the  microstructure  of  the  body.  Phase  change  through  the  stress  wave  in¬ 
volves  a  transformation  shape  change  as  well  as  volume  change,  and  the  process  leads 
to  highly  nonlinear  stress-wave  response,  including  wave  splitting  and  rarefaction 
shocks.  Accurate  characterization  of  the  stress  wave  response  requires  the  inclusion 
of  microstructural  parameters  to  account  for  local  stress  heterogeneities  which  affect 
the  range  and  shear  sensitivity  of  the  coherent  transformation. 


Perhaps  the  most  significant  shock-wave  phenomena  for  which  a  plausible  un¬ 
derstanding  has  emerged  within  the  context  of  a  model  of  heterogeneous  deforma¬ 
tion  and  accompanying  adiabatic  shear  and  temperature  trapping  is  the  4-to-6  fold 
coordination  quartz-to-stishovite  reconstructive  phase  transformation  which  occurs 
in  crystalline  Si02  [33,34].  Equivalent  shock-induced  phase  transformation  effects 
have  been  observed  in  a  number  of  silicate  minerals  as  well  as  other  materials  [35], 
however,  the  behavior  of  quartz  is  representative  and  has  historic  interest.  This 
thermally-activated  transformation  requires  minutes  to  hours  to  complete  under 
static  high  pressure  but  is  completed  within  a  few  ns  under  shock  compression. 


Shock-wave  studies  also  note  an  anomalous  metastable  Hugoniot  [35,36]  re¬ 
sponse  through  the  quartz-stishovite  mixed  phase  region,  which  relates  a  fixed  mass 
fraction  of  transformed  material  at  a  particularly  Hugoniot  pressure.  Shock  pres¬ 
sures  in  excess  of  20  GPa  over  the  initiation  transformation  pressure  are  necessary 
for  complete  transformation  tc  the  stishovite  phase.  In  addition  to  the  unusual 
Hugoniot  behavior,  release  wave  studies  from  shock  pressures  indicate  fluid  or  fluid¬ 
like  response  at  the  Hugoniot  state  [34],  The  reverse  transformation  during  shock 
unloading  reveals  large  hysteresis  in  the  complete  shock  transformation  cycle.  Shock 
recovery  experiments  uncover  complex  deformation  fabric  through  optical  and  TEM 


microscopy.  Traces  of  stishovite  and  significant  quantities  of  high-density  glass  are 
seen  in  the  recovered  samples  [37,38]. 

These  and  other  curious  shock  effects  associated  with  the  phase  change  in 
quartz  as  well  as  other  materials  are  readily  understood  within  the  context  of  a 
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heterogeneous  deformation  and  accompanying  adiabatic  shear  and  thermal  trap¬ 
ping  effect.  Calculations  show  that  temperatures  associated  with  localized  adiabatic 
shearing  are  adequate  to  accommodate  reconstructive  transformation  through  ther¬ 
mally-activated  rate  processes  within  the  risetime  of  the  shock  wave,  and  thermal 
quenching  after  passage  of  the  shock  wave  accounts  for  the  mass  fraction  trans¬ 
formed  on  the  metastable  Hugoniot  [21,23].  Also,  local  temperatures  and  thermal 
conduction  rates  are  consistent  with  the  persistence  of  laminar  melt  domains  at  the 
shock  state  and,  therefore,  the  fluid-like  release  wave  behavior.  High-density  glass 
in  recovered  shocked  minerals,  as  well  as  minerals  recovered  from  impact  meteor 
craters,  have  been  explained  within  the  context  of  a  thermal  heterogeneous  shock 
deformation  process  [39]. 

VTT.  Summary.  The  present  report  reviews  studies  focused  on  understand¬ 
ing  and  modeling  large-amplitude,  nonlinear  stress-wave  propagation  in  solids.  Re¬ 
cently  developed  time-resolved  measuring  techniques  are  providing  constraining 
data  in  terms  of  the  structure  and  evolution  of  stress  and  particle  velocity  profiles. 
The  data  indicates  that  microstructural  effects  are  fundamental  to  the  stress-wave 
propagation  phenomena.  Constitutive  modeling  of  the  dynamic  deformation  process, 
with  explicit  treatment  of  both  the  existing  and  evolving  microstructure,  is  needed 
to  calculate  complex  stress-wave  propagation.  More  specifically,  only  through  micro- 
structural  considerations  will  important  shock-wave  effects  involving  unique  physi¬ 
cal,  chemical,  and  metallurgical  processes  be  understood  and  exploited. 
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EXAMPLES  AND  SIGNIFICANCE  OF  CHANGE  OF  TYPE 
IN  VISCOELASTICITY 

Daniel  D.  Joseph^  \  Michael  Renardy^1 2*  and  Jean-Claude  Saut^ 

ABSTRACT.  The  equations  governing  the  flow  of  viscoelastic  fluids  are 
classified  according  to  the  symbol  of  their  differential  operators. 

Conditions  for  a  change  of  type  in  steady  two-dimensional  flows  are  derived 
for  a  three-constant  Oldroyd  model.  We  find  a  change  of  type  in  the  vorticity 
equation  when  a  critical  condition  involving  speeds  and  stresses  is  satisfied. 

We  also  sketch  how  change  of  type  can  be  discussed  for  more  general  models. 

I.  INTRODUCTION.  An  important  dimensionless  quantity  characterising  the 
flow  of  viscoelastic  fluids  is  the  Weissenberg  or  Deborah  number.  The  exact 
definition  of  this  quantity  varies  with  the  constitutive  model  and  the  flow 
tinder  consideration,  but,  roughly  speaking,  it  measures  the  ratio  of  elastic 
to  viscous  forces,  or,  alternatively,  of  a  time  characteristic  of  the  fluid  to 
a  time  characteristic  of  the  flow. 

Numerical  calculations  of  steady  flows  i.n  viscoelastic  fluids  typically 
fail  if  this  Weissenberg  number  is  high  or  even  moderate.  It  is  not  well 
understood  why  and  the  reason  is  probably  not  always  the  same.  Experimentally, 
qualitative  changes  in  the  flow  behaviour  are  often  observed  at  high  Weissenberg 
numbers . 

In  a  recent  paper  [b] ,  we  advance  the  idea  that  some  of  these  effects  are 
related  to  a  change  of  type  in  the  governing  equations.  We  discuss  change  of 
type  in  detail  for  a  three-constant  Oldroyd  model,  but  also  sketch  an  analysis 
for  more  general  models.  This  study  extends  earlier  work  of  Rutkevich  [10], 

Ultrnan  and  Denn  [11],  and  Luskin  [7 ] .  When  discussing  change  of  type  ve  have 
to  distinguish  between  two  cases: 

1.  There  is  a  change  of  type  for  the  equations  governing  steady  flow  as  well 
as  for  the  time-dependent  equations.  This  leads  to  Hadamard  instability 
and  ill-posedness  of  the  initial  value  problem.  This  kind  of  situation  is 
familiar  from  the  theory  of  phase  transitions. 

2.  There  is  a  change  of  type  in  the  steady  equations,  but  not  in  the  unsteady 
equations.  This  happe  when  the  speed  of  the  fluid  exceeds  a  wave  propa¬ 
gation  speed  as  in  a  sonic  transition  in  gas  dynamics.  There  is  no 
Hadamard  instability  associated  with  this. 
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Several  papers  in  the  literature  have  attempted  to  link  experimental 
observations  to  change  of  type.  Hunter  and  Slemrod  [4] ,  and,  on  the  basis  of 
a  different  model,  Becker  and  his  ceworxers  [2]  have  tried  to  explain  melt 
fracture  by  a  change  of  type  leading  to  Hadatnard  instability  (see  [1]  for  a 
detailed  and  critical  discussion  of  Becker’s  theory).  Ultman  and  Denn  (11) 
refer  to  an  observation  of  James  [5]  on  heat  transfer  in  flows  past  a 
cylinder.  It  appears  that  there  is  a  discontinuity  in  slope  when  heat 
transfer  coefficient  is  plotted  against  the  speed  of  the  fluid.  Ultman  and 
Denn  suggest  that  a  sonic  transition  occurs  at  the  speed  where  the  slope  is 
discontinuous.  Recently,  Yoo,  Ahrens  and  Joseph  (12)  have  discussed  experi¬ 
ments  by  Metzner,  Uebler  and  Fong  (8)  on  tube  entry  flows  from  a  conical 
region.  At  high  Weissenberg  number,  the  flow  partitions  into  an  interior 
cone,  where  the  streamlines  are  approximately  straight  towards  the  sink,  and 
an  outer  region  of  recirculation.  The  boundary  between  these  regions  seems  to 
be  rather  sharp,  and  there  is  an  apparent  discontinuity  in  the  vorticity  (see 
Fig.  11  in  (8J  ) .  Yoo,  Ahrens  and  Joseph  relate  this  observation  to  our 
analysis  of  Oldroyd  models.  All  these  studies  are  rather  tentative,  and  at 
present  not  enough  is  known  either  experimentally  or  theoretically  to  make 
strong  claims. 

In  section  2,  we  give  basic  definitions  relating  to  change  of  type  in 
first  order  systems  of  partial  differential  equations.  These  are  applied  in 
section  3  to  the  study  of  two-dimensional  steady  flows  for  a  class  of  three- 
constant  Oldroyd  models  (9).  A  criterion  for  criticality  is  given,  and  the 
vorticity  is  identified  as  the  variable  associated  with  the  change  of  type. 

In  section  4  we  demonstrate  how  similar  ideas  can  be  extended  to  general 
fluids  with  fading  memory.  However,  it  is  in  general  not  possible  to  decouple 
the  characteristic  equation  and  isolate  a  vorticity  equation  as  in  the  case  of 
the  three-constant  Oldroyd  model.  • 

2..  BASIC  DEFINITIONS .  The  equations  for  viscoelastic  flow  discussed 
below  have  the  form  of  quasilinear  first  order  systems.  In  this  section,  we 
give  some  definitions  relating  to  characteristics  and  change  of  type  in  such 
systems  (see  e.g.  (3) ).  We  are  concerned  with  equations  of  the  form 

n  3_u 

(2.1)  \  A^(x,u)  ~  f(x,u) 

1*0  1 

where  _u  is  a  k-vector  and  the  A^  are  k  x  jc-matrices.  The  term  "quasi- 

linear"  means  that  A.  and  f  may  depend  on  x  and  u,  but  not  on 

*x  —  —  — ~ 

derivatives  of  _u,  i.e.  the  highest  order  derivatives  occur  in  the  equations 
.  in  .a  linear  way.  Fcr  every  choice  of  x_  an<3  u_*  we  a  fine  characteristic 
surfaces  as  follows;  A  surface  givan  by  an  equation  y(t,Xj, . . . ,xr)  *  0  is 
characteristic  if 

(2.2)  Ac tU  A,  f*~)  *  0  . 

1*0  1 

The  system  is  called  elliptic  if  there  are  no  real  characteristic  surfaces. 
Hyperbolic  systems  are  characterized  as  the  opposite  extreme,  namely,  there  is 
a  maximal  number  of  real  characteristics.  More  precisely,  a  system  is  called 
hyperbolic,  if  one  of  the  matrices  A  ■  ^  is  non-singular  and,  for  every 

choice  of  real  parameters  (X^,  1  *  0,1,..., n>  1/  y),  the  roots  a  of  the 
eigenvalue  problem 
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are  real  and  semisimple.  The  equations  of  viscoelastiticity  are  neither 
elliptic  nor  hyperbolic.  However,  we  will  encounter  situations  where  the 
number  of  real  characteristic  surfaces  changes.  In  this  case,  we  say  there  is 
a  change  of  type. 

The  phenomenon  of  Hadamard  instability  is  closely  related  to  this.  It  is 
evident  that,  if  (2.3)  has  complex  roots,  then  Im  (a)  can  be  made  arbitrari¬ 
ly  large  by  making  the  X^  large.  If  we  choose  p  «  0  and  interpret  the 

first  coordinate  xfl  *=  t  as  time,  then  this  means  that  the  linearization  of 

(2.1)  will  have  rapidly  growing  solutions  when  the  initial  data  are  very 
oscillatory.  This  kind  of  catastrophic  instability  is  referred  to  as 
"Hadamard  instability". 


3.  CHANGE  OF  TYPE  IN  TWO-DIMENSIONAL  STEADY  FLOWS  OF  THREE-CONSTRNT 
OLDROYD  FLUID.  We  consider  differential  models  with  a  constitutive  law  of  the 


form 

(3.1) 


Dt 

X  —  +  T 
Dt  » 


2n  d 


where  D/Dt  denotes  a  frame  invariant  time  derivative  expressed  as 
Dt  3t 

(3.2)  *  —  +  (u-V)T  +  t  ft  -  fl  T  -  a(T  D  +  D  T)  . 

Dt  3t  —  «■«««•  m  m  m  m 

Here  wa  have  split  the  velocity  gradient  7  u  with  components  (V  u)^  « 

3ui/3x,  into  its  symmetric  part  Q  -  1/2  (7  u  +  (7  u)T)  and  its  anti- 

symmetric  part  ft  *  1/2  (7  u  -  (7  u)  ).  The  special  cases  a  *>  1,  a  *  -1  and 
a  »  0  are  known  as  the  upper  convected,  lower  convected  and  corotational 
Maxwell  model,  respectively. 

In  steady  two-dimensional  flows,  we  denote  velocity  components  by  u 
and  v,  and  the  extra  stress  tensor  is  written  in  the  form 

c  T 

(3.3)  J  »  t  )  . 

T  Y 

The  constitutive  law  (3.1),  together  with  the  equati  on  of  mu Lion  and  the 
incompressibility  condition  leads  to  the  following  quasilinear  first  order 
system 

uo  +  vo  +  T ( v  -u  )  -  a(2ou  +x(u  +v  )]  -  3  •?  u  ®  -  T 
x  y  x  y  x  y  x  Xx  X 


(3.4) 


UT  +  VT  +~  (0-Y)(U  -V  )  -  f  (O+YXu  +V  )  -  -7  (U  +V  ) 
x  y  2  y  x  2  yx  A  y  x 

UY  +  VY  +  T  (u  -v  )  -  a[2Yv  +t(u  +v  )]  -  2  -r  v  *-4 
x  y  y  x  y  y  x  X  y  X 

p(uv  +vu  )  +  p  -0  -T  *0 
x  y  x  x  y 

p(uv  +VV  )+p  —  T  — Y  *0 

x  y  y  x  y 
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Vie  can  apply  the  definitions  of  section  2  to  this  system.  This  leads  to  the 
following  equation  for  the  slope  a  *  dy/dx  of  characteristic  lines. 

(3.5)  ( 1  +a2 )  ( -au+v ) 2  {p  { -au+v ) 2  +  (2~)(a2-1)  +  2ta  -  (a2+«*(2-  +  a(2±2.) )}  -  0  . 

We  see  that  the  stream  lines  are  double  characteristics#  a:  at  two  charac¬ 

teristic  values  are  always  complex.  The  interesting  factor  ..s  the  last  one. 
The  roots  of  this  factor  change  from  -  'mplex  to  real  when  the  sign  of 

(3.6) [pu2  +  2.  ( 1-a)  -  |  (1+a)  -  2.]  t(Wa)  X  +  (a-1)  |  +  J  -  pv2]  +  (puv-t)2 
changes  from  negative  to  positive. 

The  reason  why  (3.5)  decouples  into  quadratic  factors  becomes  evident  in 
a  streamfunction-vorticity  formulation.  When  the  equations  are  rewritten  in 
this  way,  one  can  see  that  the  roots  o  =  ±i  are  associated  with  the  equation 
expressing  the  vorticity  as  the  Laplaci'an  of  the  stream  function.  The  third 
factor  is  associated  with  an  equation  which  involves  a  linear  combination  of 
second  derivatives  of  the  vorticity  and  only  contains  lower  order  terms  other¬ 
wise.  It  is  therefore  the  vorticity  which  is  associated  with  the  change  of 
type.  It  is  interesting  in  this  context  that  the  experiments  of  Metzner# 
Uebler  and  Fong  [8]  can  be  interpreted  as  suggesting  a  discontinuity  in  the 
vorticity. 

One  can  also  derive  a  time  dependent  vorticity  equation#  which  leads  to  a 
criterion  for  Hadaznard  instability.  Hadamard  instability  occurs  if  one  of  the 
following  conditions  is  violated 

(3.7)  X2t2  -  (n  -  X(X  (i-a)  -  |  (1+a))] In  -  X  (|  (1-a)  -  X  (1+a))]  <  0 

(3.8)  X[X  (1-a)  -  |  (1+a)]  -  n  <  0  . 

Note  that  (3.7)  agrees  exactly  with  (3.6)  for  zero  speeds.  Changes  of  type  in 
steady  flow  which  do  not  involve  Hadamrd  instability  must  therefore  require  a 
non-zero  speed  of  the  fluid.  In  fact#  the  criterion  is  that  the  speed  of  the 
fluid  is  faster  than  a  viscoelastic  wave  speed.  In  particular#  if  the 
stresses  vanish,  a  change  of  type  occurs  when  the  fluid  speed  exceeds  the  wave 
speed  of  linear  viscoelasticity.  Since  this  requires  a  finite  (but  not  large) 
Reynolds  number#  such  changes  of  type  are  more  likely  to  be  found  in  dilute 
polymer  solutions  rather  than  in  melts. 

In  discussing  the  criteria  (3.6)  or  (3.7),  (3.8) ?  it  must  be  kept  in  mind 
that  the  values  of  the  extra  stresses  are  not  arbitrary.  The  constitutive  law 
(3.1>  can  be  regarded  as  an  evolution  problem  for  the  stress  with  given  defor¬ 
mation.  However,  in  the  discussion  of  materials  with  fading  memory#  we  are 
not  interested  in  arbitrary  solutions  of  this  evolution  problem,  but  only  in 
those  that  behave  reasonably  as  time  tends  to  This  imposes  restrictions 

on  the  values  of  the  extra  stresses#  which  can  be  shown  to  preclude  Hadamard 
instability  if  a  *  ±1. 

For  a  discussion  of  particular  flow  geometries  we  refer  to  [6]  ard  [12]. 


4.  CHANGE  OF  TYPE  IN  FLUIDS  WITH  FADING  MEMORY.  The  extra  stress  T  in 
a  simple  fluid  is  given  by  an  isotropic  functional  of  the  history  of  tne 

T 

relative  Cauchy  strain  G\s)  **  (t-s>F^(t-s)  -  1,  i.e. 

=  =t  =t  * 

(4.1)  T«FtG(s))*=0  * 

By  taking  the  material  derivative  of  i4.1),  ve  obtain 

dt  .  dG 

<4‘2>  dT'Il^ldi3  * 

Following  Coleman  and  Noll,  we  assume  that  the  Frechet  derivative  F-j  of  the 
functional  g  can  be  represented  in  the  form 


(4.3) 


dG  dG(s) 

!,!§  I  aF1  ■  l~o  £ls'2>  -4:—  as 


dt 


Here  K(s,S)  is  a  fourth  order  tensor  depending  on  s  and  the  values 

{G(0),~O  <  o  <<*’}.  For  the  following,  we  assume  that  K  and  its  first 
derivative  with  respect  to  s  are  integrable. 

The  material  derivative  of  Q  is  given  by 

dG  dG 

(4.4)  ^--LG-GL-  — 

where  L  «  V  u  is  t^e  present  value  of  the  velocity  gradient.  Hence  we  find 


dG, 


If  £  *w 

‘••S’  —  !slas  =  -  VKi3k*  +  KiiJk,Vls)ds  •  V*’ 


h  Kijki'~'='  dt 


(4.5) 


dG(s) 


K4,ve(s,G) 


J0  ‘'ijkf ds 


ds 


The  last  term  can  be  integrated  by  parts  and  treated  as  a  perturbation  of 
lower  differential  order.  With 


(4.6) 


Md 


■  -  VKiikI  +  Ki1ik,Gpt(S,dS 


i3kp  -  -  'o”'ijkl  ■  "ijik-pt' 

we  can  therefore  write  the  equations  of  viscoelastic  fluid  motion  in  the  form 

9u 


dtij 

dt 

du. 


(4.7) 


dt 


Mijkp  3xr  +  Nij 

E  -  !P_+!lii+  f 

3x.  3x .  xi 
i  3 


3u. 


3x^  =  0 

This  again  has  the  form  of  a  quasilinear  first  order  system,  and  the 
definitions  of  characteristics  and  change  of  type  apply.  In  general,  however, 
it  is  not  possible  to  decouple  this  system  as  in  section  3  and  isolate  a 
vorticlty  equation.  In  two-dimensional  steady  flow,  wa  would  still  find  the 
stream  lines  as  double  characteristics,  but  the  remaining  characteristic 
values  would  be  determined  by  a  fourt.h  order  equation,  which  cannot  easily  be 


factored.  In  (61 ,  we  identify  a  class  of  constitutive  models  which  has 
certain  structural  similarities  with  the  Oldroyd  models  above  and  permits  the 
derivation  of  a  vorticity  equation* 
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ABSTRACT.  A  new  method  has  been  developed  for  solving  the  partially 
plastic  problems  of  thick-walled  cylinders  made  cf  strain-hardening  or 
ideally-plastic  materials  subjected  to  any  combination  of  internal  pressure, 
external  pressure,  and  end  loads.  The  incremental  strains  are  chosen  as  the 
basic  unknowns  in  the  finite-difference  formulation.  The  incremental  sizes  of 
the  applied  loading  are  determined  automatically  and  no  iteration  is  needed. 
Complete  solutions  for  the  stresses,  strains,  and  displacement  base  been 
obtained  and  all  numerical  results  are  very  accurate.  This  approach  Is  also 
efficient  and  simple,  yet  quite  general,  when  compared  with  many  solutions  in 
the  literature. 

I.  INTRODUCTION.  The  partially  plastic  problem  of  pressurized  thick- 
walled  cylinder  is  of  practical  importance  to  pressure  vessels  and  the 
autofrettage  process  of  gun  barrels.  Many  solutions  for  this  problem  have 
been  reported  [1-7] .  For  thick  tubes  under  very  high  pressure  operation,  the 
elastic-plastic  material  model  should  be  represented  by  the  von  Mises'  yield 
criterion,  Prandtl-Reuss’  incremental  stress-strain  laws,  the  strain¬ 
hardening,  and  compressibility  [8] .  However,  a  closed-form  solution  exists 
only  in  the  plane-strain  case  neglecting  strain-hardening  and  compressibility. 

For  the  generalized  plane-3train  problems  considered  here,  numerical 
solutions  were  reported  by  the  finite-difference  method  [4,7]  and  finite- 
element  method  [5] .  The  incremental  displacements  were  used  as  the  basic 
unknowns  and  a  displacement  function  was  assumed  in  the  finite-element  method 
[5] .  The  incremental  stresses  and  strains  were  used  in  [4]  as  the  basic 
unknowns,  but  only  the  incremental  strains  were  used  in  [7].  The  spatial 
discretization  used  in  [4,7]  was  based  on  the  forward  difference  scheme  and  a 
fixed  sequence  of  incremental  loading  was  used. 

In  the  present  paper,  a  new  method  is  developed  and  more  accurate 
numerical  results  are  obtained.  The  incremental  strains  are  chosen  as  the 
basic  ynk uowas  in  the  finite-difference  formulation.  Both  strain-hardening 
and  ideally-plastic  materials  can  be  considered.  The  spatial  discretization 
are  based  on  the  central  difference  scheme  and  the  incremental  -  ies  of  the 
applied  loading  are  determined  automatically  in  the  program.  Th  < ..cremental 
results  are  calculated  directly  and  no  iteration  is  needed.  The  divergence 
oF  the  approach  will  be  discussed  and  more  accurate  results  will  be  reported. 
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II 4  BASIC  EQUATIONS.  Assuming  small  strain  and  no  body  forces  in  the 
axisymmetric  state  of  generalized  plane-strain,  the  radial  and  tangential 
stresses,  or  and  09,  must  satisfy  the  equilibrium  equr  '•on, 

r(3ar/3r)  *  09  -  ar  ;  (1) 

and  the  corresponding  strains,  er  and  eg,  are  given  in  terms  of  the  radial 
displacement,  u,  by 

er  *  3u/3r  ,  eg  =*  u/r  (2) 

It  follows  that  the  strains  mu3t  satisfy  the  equation  of  compatibility 

r( 3eg/3r)  »  er  -  eg  (3) 

If  the  material  is  assumed  to  be  elastic-plastic,  obeying  the  Mises'  yield 
criterion,  the  Prandt.l-Reuss  flow  theory,  and  the  isotropic  hardening  law,  the 
stress-strain  relations  are  [1]: 

dej/  -  doi’/2G  +  (3/2)0^0/(011)  (4) 

do  >  0  for  i  •*  r,  0,z 

de*  *  ET^l^vOdOa  (5) 

where  E,  v  are  Young's  modulus,  Poisson's  ratio,  respectively, 

2G  -  E/(l+u), 

em  *  (er+egfez)/3  ,  ej*  *  ej  -  % 

-  (or+a&toz)/3  ,  oi»  -  Oi  -  o,, 

o  «  (l//2)[(or-og)2  +  (og-o2)2  +  (oz-or)2]1/2  >  aQ  (6) 

and  o0  is  the  yield  stress  in  simple  tension  or  compression  For  a  strain¬ 
hardening  material,  H'  is  the  slope  of  the  effective  stress/plastic  rtrain 
curve 

0  -  H(JdcP)  (7) 

For  an  ideally-plastic  material  (H*  *0),  the  quantity  (3/2)do/(oH')  is 
replaced  by  dX,  a  positive  factor  of  proportionality.  When  o  <  o0  or  do  >  0, 
the  state  of  stress  is  elastic  and  the  second  term  in  Eq.  (4)  disappears. 
Following  Yamada  et  al  [9],  Eqs.  (4)  and  (5)  can  be  rewritten  in  an 
incremental  form 

doi  *  dj4d£4  for  i,j  *  r,0,z 

(8) 

and 

dfj  ■  tv/(l-2v)  +  -  Oj['aj’/S] 
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where 


S 


2  1  , 
-  (1  +  -  H’/G)o2 
3  3 


H‘/E  «  u)/(l-u) 


(9) 


u>E  is  the  slope  of  the  effective  stress-strain  curve,  and  is  the  Kronecker 
delta. 


This  form  was  used  in  the  finite-element  formulation  for  solving  elastic- 
plastic  thick-walled  tube  problems  [5].  In  the  following  section,  the 
incremental  stress-strain  matrix  will  be  used  in  the  finite-difference 
formulation. 


III.  FINITE-DIFFERENCE  FORMULATION.  Consider  a  thick-walled  cylinder  of 
inner  radius  a  and  external  radius  b.  The  tube  is  subjected  to  inner 
pressure  p,  external  pressure  q,  and  end  force  f.  The  elastic  solution  for 
this  problem  is  well-known  and  the  pressure  p*,  q*,  or  f*  required  to  cause 
initial  yielding  can  be  determined  by  us-ing  the  Mises*  yield  criterion.  For 
loading  beyond  the  elastic  limit,  an  incremental  approach  of  the  finite- 
difference  formulation  is  used.  The  cross-section  of  the  tube  is  divided  into 
n  rings  with  fi=a,r2, . . .  .r^P.  •  •  •  »rn+l“b  where  p  is  the  radius  of  the  elastic- 
plastic  interface.  At  the  beginning  of  each  increment  of  loading,  the 
distribution  of  displacements,  strains,  and  stresses  are  assumed  to  be  known 
and  we  want  to  determine  Au,  Aer,  Ae@,  Aez,  Aor,  Aoq,  Aoz  at  all  grid  points. 
Since  the  incremental  stresses  are  related  to  the  incremental  strains  by  the 
incremental  form  (Eq.  (8))  and  Au  *  rAeg,  there  exists  only  three  unknowns  at 
each  station  that  have  to  be  determined  for  each  increment  of  loading. 
Accounting  for  the  fact  that  the  axial  strain  ez  is  independent  of  r,  the 
unknown  variables  in  the  present  formulation  are  (Aee)^,  (Ae^^,  for  i  » 

1,2, . . .  ,n,n+l ,  and  Aez. 


The  equation  of  equilibrium  (1)  and  the  equation  of  compatibility  (3)  are 
valid  for  both  the  elastic  and  the  plastic  regions  of  a  thick-walled  tube.  A 
first-order-correct  finite-difference  analog  of  these  two  equations  at  i  * 
l,...,n  has  been  given  in  [4,71.  Other  finite-difference  forms  can  be 
written.  In  the  present  paper,  the  difference  equations  given  below  are 
second-order-correct.  The  equation  of  compatibility  (3)  and  equation  of 
equilibrium  (1)  are  replaced,  respectively,  by 

cii(Aee)i  +  C2i(Aer)i  +  c3i(Aeg)i+1  +  C4i(  Aer)i+1  =  csi  (10) 

and 

cli(Aor)i  +  C2i(Aoe>i  +  c4i(Aar)i+i  +  C4i(Aae)i+i  *  C6i  (11) 


where 


cli  -  -  -  +  j  Ti  ,  c2i  =  -  -  -  Yi 


°3i  =  2  "  2  *  C4i  “  “  2  +  2  Yl 


c5i  =  “  cli<£e)i  -  c2i(er>i  “  C3i(ee)i+1  -  C4i(er)i+1 


and 


c6i  *  “  cli(ar>i  ”  c2iX°0)i  “  c8i(°r>i+l  “  c4i(°8)i+l 


-  ri+l/ri  (12) 

With  the  aid  of  the  incremental  stress-strain  relations  (8),  Eq.  (11)  can 
be  written  as 

c7i(Ae0)i  +  C8i(Aer)i  +  C9i(Ae0)i+l  +  ciOi(Aer)i+l  +  ciii(Aez)  «  C6i  (13) 
where 

c7i  -  cu(di2)i  +  C2i(d22)i  »  =8i  "  cii(dll>i  +  C2i(d2l)i 

c9i  *  C3i(dl2>i+1  ■*  c“4i(d22)i+l  ,  ciot  *  C3i(dll)i+1  +  C4i(d22)i+1 

clli  *  cli<d13)i  +  C2i(d23)i  +  C3i(di3)i+i  +  C4i(d23)i+1  (1*) 

The  boundary  conditions  for  the  problem  are 

Aor(a,t)  *  -Ap  ,  Aor(b,t)  *  -Aq 
n 

*  I  [ti(Aoz)i  +  ri+i(Aoz)i+i](ri+i-ri)  -  U*a2Ap  +  Af  (15) 

i-1 

where  u  is  0  for  open-end  tubes,  and  1  for  closed-end  tubes.  Using  the 
incremental  relations  (8),  we  rewrite  Eq.  (15)  as 

(di2)i(Aee)i  +  (du)i  (Ae^i  +  (di3>iAez  »  -Ap  (16) 

(d12)tri-l(Ae9)n4-l  +  (dll>n+l(Aer)ttfl  +  (d13>rri-lAez  3  (i7> 

and 

n 

I  lc12i(Ae6)i  +  c13i(Aer)i  +  c14l( Ae9)i+1  +  cis^Ae,.)^ 

1*1 

+  c16i(Aez)3  *  Va2Ap  +  Af/n  (18) 

where 

c12i  “  (iri+l“ri)ri(d32)i  ,  c!3i  -  (ri+l-*i)ti(d3l)i 

cl4i  -  (ri+i-ri)ri+i(d32)i+l  ,  ci51  -  (ri+i-ri)ri+1(d3i)i+i 

c16i  ■  (ri+l“ri)tri(d33)i  +  ri+l(d33>i4-l]  (19) 

Now  we  can  form  a  system  of  2»i-3  equations  for  solving  2n+3  unknowns,  ( Ae0)^, 
(Aer)i*  at  i  ■  1,2, . . .  ,n,ttH  and  Aez.  Equations  (16),  (17),  and  (18)  are 
taken  as  the  first  and  last  two  equations,  respectively,  and  the  other  2n 
equations  are  set  up  at  i  *  l,2,...,n  using  Eqs.  (10)  and  (13).  The  final 
system  is  an  unsyaaetric  matrix  of  arrow  type  with  the  nonzero  terms  appearing 
in  the  last  row  and  column  and  others  clustering  about  the  main  diagonal,  two 


below  and  two  above.  In  the  compute/:  program  which  was  developed,  the 
dimensionless  quantities  r/a,  Ser/o0,  E ee/a0,  Eez/o0,  or/o0,  oq/o0,  az/a0, 
p/Oq»  q/cr0,  f/(ra2o0)  were  used  in  the  formulation  and  the  Gaussin  elimination 
method  was  used  to  solve  these  equations.  All  calculations  were  carried  out 
on  IBM  4341  with  double  precision  to  reduce  round-off  errors. 

IV.  OPTIMAL  INCREMENTAL  LOADING.  Given  zny  combination  of  incremental- 
loading  (Ap,  Aq,  or  Af ) ,  we  can  now  determine  all  incremental  results 
(displacements,  strains,  and  stresses)  directly.  No  iteration  is  needed, 
while  in  [4],  many  iterations  in  each  step  were  required  because  a  value  for 
Aez  was  assumed.  The  sizes  of  incremental-loading  should  be  chosen  properly 
in  order  to  obtain  accurate  results  at  a  reasonable  cost.  When  the  total 
applied  pressure  p  is  given,  it  is  natural  to  divide  the  loading  path  in  m 
equal  fixed  increments  such  as  Ap  »  (p-p*)/m.  Larger  values  of  m  give  more 
accurate  results.  '•  sequence  of  decreasing  load-increments  is  a  better  choice 
than  that  of  equal  increments.  In  order  to  increase  the  efficiency  without 
effecting  the  accuracy,  an  adaptive  algorithm  has  been  implemented  on  the 
basis  of  a  scaled  incremental-loading  approach  [5] . 

In  each  step,  a  dummy  load-increment  such  as  Ap  is  applied  and  the 
incremental  results  Ao^  for  i  *  r,0,z  at  all  grids  are  determined.  For  all 
grid  points  at  which  a  =■  |lo^||  <  a,  we  compute  the  scaler  a’s  by  the  fonaula 

«  .  I  {r  +  [T2  +  4 1  |  Aojl  1 2(o&2-|  |  Of  |  1 2)]*-/2}|  |  Ao$J  | 2  (20) 

where 

r-  ll°i!l2+  ll*°ill2-  IKII2  <a> 

»“•  I KII,  il4<,ill.  Il°i  +  4»ill  are  computed  by 

I  kil  !2  -  -  K<v°e>2  +  (°e-oz)2  +  ( az“°r)2]  (22) 

Let  X  be  the  minimum  of  the  a’s.  Then  X  is  the  load-increment  factor  just 
sufficient  to  yield  one  additional  point.  A  sequence  of  X(j)  can  be 
determined  for  all  steps  j  *  l,2,...ir  and  the  updated  results  are 

p(j)  »  p(j”l)  +  X(j)Ap(j) 

a±(.i)  -  OiU”1)  +  X( j) Ao^C j)  ,  etc.  (23) 

This  sequence  of  incremental  loading  is  optimal  for  the  present  problem 
because  all  the  coefficients  c’s  in  Eqs.  (12),  (14),  and  (19)  are  functions  of 
the  previous  stresses  and  strains. 

V.  CONVERGENCE  STUDY.  In  order  to  demonstrate  the  accuracy  of  the 
approach,  four  convergence  studies  are  made.  Consider  a  thick-walled  tube  of 
wait  ratio  b/a  -  2  and  subjected  to  internal  pressure  only.  The 
cross-section  of  the  tube  is  divided  into  n  rings  of  equal  thicknesses,  i.e,. 
h  »  (b-a)/n.  The  first  problem  is  a  closed- -end  tube  loaded  in  the  elastic 
range  with  G  ■  10s/3  psi,  v  «  0.3,  p  *  5  psi.  The  numerical  results  with  n  ■ 


10,  20,  50,  100  are  shown  in  Table  1  together  wicb  the  Lame'  solution  for  the 
hoop  stresses  and  strains  at  the  boundaries  a  and  b.  The  numerical  results 
are  correct  up  to  four  digits  with  n  *  100. 

TABLE  1.  ELASTIC  SOLUTION  FOR  A  CLOSED-END  TUBE 

(b/a  -  2,  G  *  105/3  psi,  "  -  0.3,  p  *  5  psi) 


Eee/a 


Eee/b 


8.3333 

3.3333 

9.3333 

2.8333 

8.3000 

8.3256 

8.3320 

8.3330 

3.3000 

3.3250 

3.3320 

3.3330 

9.3000 

9.3250 

9.3320 

9.3330 

2.8000 

2.8250 

2.8320 

2.8330 

The  second  problem  is  the  initial  yielding  solution  for  a  plane-strain 
tube  with  E/o0  *  200,  v  «  0.3,  ez  *  0.  The  numerical  results  with  n  *  10,  20, 
50,  100,  200,  are  shown  in  Table  2  together  with  the  exact  solution  for  the 
dimensionless  p  *  p/60,  oq  «  og/o0,  eq  =*  (E/oo)(E0)  at  r  *  a  and  b. 

TABLE  2.  INITIAL  YIELDING  SOLUTION  FOR  A  PLANE-STRAIN  TUBE 

(b/a  -  2,  E/c0  -  200,  v  -  0.3) 


ee/b 


.26226 


The  third  problem  is  the  elastic-perfectly  plastic  solution  for  a  plane- 
strain  tube  with  b/a  *  2,  E/o0  *  200,  v  *  0.3,  w  »  0,  ez  »  0.  The  numerical 
results  with  n  *  10,  20,  50,  100,  200  are  shown  in  Table  3  for  the  pressure 
and  displacement  at  the  bore  corresponding  to  50  percent  and  100  percent 
overstrain,  i.e.,  p/a  »  1.5  and  2.0. 
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TABLE  3.  ELA5TIC-PERFECTLY  PLASTIC  SOLUTION  FOR  A  PLANE-STRAIN  TUBE 

(b/a  -  2,  E/aft  =  200,  v  -  0.3) 


50%  Overstrain 


100%  Overstrain 


.71808 

.71803 

.71801 


1.97725 

1.96969 

1.96780 

1.96759 

1.96757 


9849 
9790 
9760 
.79751 
.79747 


3.68937 

3.66831 

3.66280 

3.66211 

3.66199 


The  numerical  results  converge  and  are  accurate  up  to  four  digits  with  n  ■ 
100.  There  is  no  closed-form  solution  available  for  comparison.  The  famous 
paper  by  Hodge  and  White  [2]  has  been  used  quite  often  as  a  basis  for 
assessing  the  accuracy  of  other  approximate  methods.  However,  the  numerical 
integration  is  cumbersome.  The  present  formulation  is  much  simpler  and  seems 
more  accurate.  In  order  to  further  demonstrate  the  accuracy  of  the  present 
approach,  a  convergence  study  for  an  incompressible,  ideally-plastic  thick 
tube  in  plane-strain  condition  has  been  made  and  compared  with  exact  solution 
[2].  The  numerical  results  for  a  nearly  incompressible  material  (v  » 

0.49999)  are  shown  in  Table  4  together  with  the  exact  solution  (v  *  1/2)  for 
the  internal  pressure  end  the  displacement  at  the  bore  corresponding  to  50 
percent  and  100  percent  overstrain. 

TABLE  4.  INCOMPRESSIBLE,  IDEALLY-PLASTIC  SOLUTION  FOR  A  PLANE-STRAIN  TUBE 

(b/a  -  2,  E/o0  -  200,  v  -  0.49999) 


50%  Overstrain 


100%  Overstrain 


10 
20 
50 
100 
Exact 


69 
77 
.72078 
.72078 
.72078 


1.95714 

1.95072 

1.94891 

1.94865 

1.94856 


3.48806 

3.47012 

3.46509 

3.46437 

3.46410 


shows  the  results  of  the  dimensionless  quantities  p/o0,  00/0 0>  °z/ ^0, 

(E/o0)er,  (E/o0)er,  (E /oo)£0,  (E/o0)e2  at  the  inside  or  elistic-plastic 
boundary  p  for  p/a  0  1.0,  1.1,  1.2,.. .,2.0  in  a  plane-strain  tube  with  strain¬ 
hardening  parameter  to  *  0.1.  Tables  6  and  7  show  the  similar  results  of  the 
dimensionless  stresses,  strains,  displacement  at  the  bore  or  elastic-plastic 
boundary  for  various  stages  of  elastic-plastic  loadings  in  an  open-end  or 
closed-end  tube,  respectively. 

TABLE  5.  ELASTIC-PLASTIC  SOLUTION  FOR  A  PLANE-STRAIN  TUBE 
(b/a  ■  2,  E/o0  ■  200,  v  -  0.3,  et/Et  *  0.1,  n  «  100,  e2  ■  0) 


p/a 

v/°n 

Inside 

oe/o(. 

oe/oJo 

Inside 

oJan 

0rJ  I  p 

L  *rla 
°o 

E  °a 

oQ  a 

1.0 

.43229 

.72049 

.72049 

.08646 

.08646 

-.67438 

.82424 

1.1 

.51296 

.66972 

.75016 

.05586 

.10453 

-.91636 

1.00141 

1.2 

.58362 

.63049 

.78249 

.02537 

.12427 

-1.17381 

1.20371 

1.3 

.64556 

.60225 

.81739 

-.00409 

.14566 

-1.44917 

1.43022 

1.4 

.69982 

.58427 

.85479 

-.03148 

.16866 

-1.73882 

1.68000 

1.5 

.74725 

.57572 

.89457 

-.05589 

.19323 

-2.04304 

1.95207 

1.6 

.78851 

.57579 

.93667 

-.07658 

.21932 

-2.36107 

2.24535 

1.7 

.82416 

.58376 

.98092 

-.09302 

.24687 

-2.69205 

2.55869 

1.8 

.85466 

.59898 

1.02718 

-.10490 

.27581 

-3.03504 

2.89081 

1.9 

.88041 

.62086 

1.07530 

-.11205 

.30606 

-3.38898 

3.24034 

2.0 

.90177 

.64885 

1.12509 

-.11447 

.33755 

-3.75274 

3.60578 
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A  REFINED  SHEAR  DEFORMATION  THEORY 
FOR  LAMINATED  ANISOTROPIC  PLATES 


J .  N .  Ready 
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Blacksburg,  Virginia  24061  USA 

ABSTRACT .  An  improved  plate  theory  that  accounts  for  the  transverse 
shear  deformation  is  presented*  The  theory  is  based  on  an  assumed 
displacement  field  in  which  the  inplane  displacements  are  expanded  in 
terms  of  the  thickness  coordinate  up  to  the  cubic  term  and  the  transverse 
deflection  is  assumed  to  be  independent  of  the  thickness  coordinate.  The 
governing  differential  equations  ot  the  theory  are  derived  from  the 
virtual  work  principle.  The  theory  eliminates  the  need  for  shear 
correction  factors  because  the  transverse  shear  stresses  are  represented 
parabolically.  A  comparison  of  the  present  numerical  results  for  bending 
with  the  exact  solutions  of  the  three-dimensional  elasticity  theory  shows 
that  the  present  theory  is  more  accurate  than  the  first  order  shear 
deformation  theory. 

I .  INTRODUCTION .  The  classical  laminate  theory  (CLT)  ,  which  is  an 
extension  of  the  classical  plate  theory  (CPT)  to  laminated  plates,  Is 
inadequate  for  laminated  plates  made  of  advanced  filamentary  compos 
materials.  This  is  because  the  effective  elastic  modulus  to  the  effective 
shear  modulus  ratios  are  very  large  for  such  laminates.  An  adequate 
description  of  the  transverse  shear  stresses,  especially  near  the  edges, 
can  be  achieved  with  the  use  of  a  shear  deformation  theory. 

The  first,  stress-based,  shear  deformation  plate  theory  is  due  to 
Reissner  [1-3] .  The  theory  is  based  on  a  linear  distribution  of  the 
inplane  normal  and  shear  stresses  through  the  thickness, 
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(h2/ 6) 


(h72) 
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mhere  (0^,0.)  an^  sre  the  normal  and  shear  stresses,  (M^,M2)  and  Mg  are 
the  associated  bending  moments  (which  are  functions  of  the  inplane 
coordinates  x  and  y)  ,  z  the  thickness  coordinate  and  h  is  the  total 
thickness  of  the  plate.  The  distribution  of  the  transverse  normal  and 
shear  stresses  (<J_,  and  o^)  is  determined  from  the  equilibrium 
equations  of  the  three-dimensional  elasticity  theory.  The  differential 
equations  and  the  boundary  conditions  of  the  theory  were  obtained  using 
Castigliano's  theorem  of  least  work. 

The  origin  of  displacement-based  theories  is  apparently  attributed  to 
Basset  [4],  who  begins  his  analysis  with  the  assumption  that  the 
displacement  components  in  a  shell  can  be  expanded  in  a  series  of  powers 
of  the  thickness  coordinate  C*  For  example,  the  displacement  component  u^ 
is  written  in  the  form  A 
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(2a) 


ui(wc>  -  "jtei’V +  z 

n»l 

where  and  are  the  curvilinear  coordinates  in  the  middle  surface  of 
the  shell,  and  '  have  the  meaning 

,n 

U1  ^1  ’^2^  53  n” ^  ^=0  *  P  s  0,1,2 .  (2b) 

dC 

Basset's  work  did  not  receive  as  much  attention  as  it  deserves*  In  1949 
NACA  technical  note,  Hildebrand,  Reissner  and  Thomas  [5]  presented  (also 
see  Hencky  f 6 ] >  a  displacement -based  shear  deformation  theory  for  shells 
(which  obviously  can  be  specialized  to  flat  plates) .  They  assumed  the 
following  displacement  field, 

ui(Wc>  =  u^i»V  =  c  VW 

n2avt2,Q)  -  vcq,^)  +  C  *yavZz) 

U3(W°  ’  (3) 

The  differential  equations  of  the  theory  are  then  derived  using  the 

principle  of  minimum  total  potential  energy.  This  gives  five  differential 

equations  in  the  five  displacement  functions,  u,  v,  w,  c|*  ,  and 

4-  *  3 

y 

The  shear  deformation  theory  based  on  the  displacement  field  In  Eq. 
(3)  for  plates  is  often  referred  to  as  the  Mindlin'e  plate  theory. 

Mindlin  [7]  presented  a  complete  theory  based  on  the  displacement  field 
(3)  taken  from  Hencky  [6] .  The  literature  review  points  out  that  the 
basic  idea  came  from  Basset  [4],  Hildebrand,  Reissner  and  Thomas  [5],  and 
Hencky  [6] .  Therefore,  by  referring  the  displacement.-based  shear 
deformation  theory  as  Mindlin's  theory  we  are.  not  giving  due  credit  to  the 
others.  We  shall  refer  to  the  shear  deformation  theory  based  on  the 
displacement  field  (3)  as  the  first-order  shear  deformation  theory. 

Higher-order,  displacement-based,  shear  deformation  theories  have 
been  Investigated  by  Nelson  and  Lorch  [8] ,  Librescu  [9]  and  Lo, 

Christensen  and  Wu  [10].  These  higher-order  theories  are  cumbersome  and 
computationally  demanding  because  with  each  additional  power  of  the 
thickness  coordinate  an  additional  dependent  unknown  is  introduced,  per 
displacement  component,  into  the  theory.  Levinson  [11]  and  Hurthy  [12] 
presented  third-order  theories  that  assume  transverse  inextensibility. 

The  nine  displacement  functions  were  reduced  to  five  by  requiring  that  the 
transverse  shear  stresses  vanish  on  the  bounding  planes  cf  the  plate. 
However,  both  authors  used  the  equilibrium  equations  cf  the  first-order 
theory  in  their  analysis.  Stating  in  other  words,  the  higher-order  terms 
of  the  displacement  field  are  accounted  only  in  the  calculation  of  the 
strains  but  not  in  the  governing  differential  equations  or  in  the  boundary 
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conditions.  Recently,  the  present  author  (see  (13,14])  corrected  these 
theories  by  deriving  the  governing  differential  equations  from  variational 
principles.  In  this  paper  we  review  the  results  from  [13,14]  and  present 
the  variational  formulation  of  the  boundary-value  problem  associated  with 
the  theory.  The  existence  and  uniqueness  questions  will  be  investigated 
elsewhere. 


II.  THE  REFINED  THEORY.  Experimental  evidence  shows  that  tue 
transverse  shear  stresses  vary  parabolically  through  the  thickness, 
especially  near  the  edges  of  a  laminate.  A  parabolic  distribution  of  the 
shear  stresses  is  provided  by  a  displacement  field  in  which  the  inplane 
displacements  vary  as  cubic  functions  of  the  thickness  coordinate.  We 
begin  with  the  displacement  field  of  the  form  (3) ,  except  that  and  U2 

are  assumed  to  be  cubic  functions  of  the  thickness  coordinate: 

2  3 

u1(x,y>z)  =  u(x,y)  +  z<i>x(x,y)  +  z  5x(x,y)  +  z  Cx(x,y) 

2  3 

u2(x,y,z)  «  v(x,y)  +  z<J<y(x,y)  +  z  5y(x,y)  +  z  Cy(x»y) 

u3(x,y,z)  -  w(x,y)  (4) 

where  the  xy-plane  is  taken  to  coincide  with  the  midplane  Q  of  the 
laminate  and  z  is  the  thickness  coordinate  (i.e.,  transverse  to  the 
laminate) .  Using  the  boundary  conditions 

a,  =  a  =*  0  ,  cr_  =*  c  “  0  at  z  *  ±  -I  (5) 

4  yz  *  5  xz  2 

we  can  eliminate  5  T  and  C  (see  Reddy  [13])  and  obtain 

x  v  c  y 

U1  ’  u+  -f  <f>2(,|’x+!l)1 
u2-v+z[+y-|(f)2»y+|^] 

u^  *  w  (6) 

It  is  not  difficult  to  see  that  (u,v,w)  are  the  displacements  of  the  point 
(x,y,0)  and  <[>  and  ^  are  the  rotations  of  transverse  normals  at  point 
(x,y,C)  aboutxthe  y  Xnd  x  axes  respectively. 

The  governing  differential  equations  of  the  theory  can  be  obtained  by 
employing  the  principle  of  vitual  .displacements  (see  Reddy  [15]) : 


where  A^j,  ®ij»  etc.,  are  the  plate  stiffnesses,  defined  by 
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and  e^,  and  k (i*l,2,4,5,6)  are  the  strain  components 
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III x  EXISTENCE  AND  UNIQUENESS  OF  SOLUTIONS.  Here  we  consider  the 
Dirichlet  problem  for  the  equations  in  (8) .  For  symmetric  cross-ply 
laminates,  the  first  two  equations  become  uncoupled  from  the  last  three 
equations  of  Eq«  (8).  icr  zero  inplane  forces  we  have  u  *  v  *  0.  Tlius, 
we  are  required  to  7.ini  w,  4»x  and  i  under  given  boundary  conditions  (say, 
clamped  plate)  and  distributed  loacrf .  In  this  part  of  the  discussion  we 
«*»all  use  the  standard  notation  of  functional  analysis  (see  Rektoryp 
[161)  • 


The  variational  problem  associated  with  the  last  three  equations  in 
(8)  with  the  zero  essential  boundary  conditions  involves  finding 
A  «  (w,^,^  £  H  such  that 

B(A,A)  -  A(A)  for  all  A  £  H  (13) 


where  H  denotes  the  product  space 


H  -  H2(Q)  x  H*(Q)  x  H*(Q)  (14) 

Here  Q  denotes  the  midplane  of  £he  plate  ^assumed  to  be  a  Lipschitrian) 
with  a  smooth  boundary  ”,  and  Hi(Q)  and  u  (Q)  are  the  Hilbert  spaces  of 
order  1  and  2,  respectively,  wi?h  compact°support, 

a  | 

Hq(Q)  »  |u:  Du  Cl^Q),  ljjl  *  m,  D  u  ■  0  on  r, 

l|l  <  m  -  1}  (15) 

2 

Thus,  if  w  is  in  H  (Q)  then  w  and  its  derivatives  of  order  1  and  2  are 
square  integrable  in  Q  and  w  and  its  first  derivatives  vanish  on  T*  The 
bilinear  form  B(*,*)  and  linear  form  !(•)  are  defined  by 
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The  existence  and  uniqueness  of  the_solution  to  Eq.  (13)  follows  from 
the  Lax-Milgram  theorem  (See  [16])  if  B(A,A)  is  continuous  and  H-elliptic, 
l.e.,  there  exist  constants  and  such  that 


Ib(.*\,a)  |  <  po  iaihiaih 


B(A,A)  >  nx  IAI* 


and  f(A)  is  continuous  on  H.  Here  !•!_  denotes  the  product  norm  in  H, 


l(w,4>  ,4»  )*2  •  Iwl2  +  l(|*  I2,  14*  I2- 


Hfc(Q) 


H  (Q) 


H  (Q) 


The  continuity  of  B( • ,  •)  and  !(•)  can  be  easily  established  using  the 


Cauchy-Schwarz  inequality  (the  constant  pp  depends  on  the  maximum  values 

of  Dj j,  Fj^  j  and  Hjj)  -  However,  the  proof  of  the  H-ellipticity 

of  B(  • , *)  Is  considerably  involved  and  will  be  addressed  in  a  separate 


publication. 


I"  EXACT  SOLUTIONS  AND  NUMERICAL  RESULTS.  Exact  solutions  to  the 
bound,.  ’alue  problem  described  by  the  first  three  equations  of  Eq.  (B). 
with  tii  following  simply  supported  boundary  conditions  of  a  rectangular 
plate  can  be  obtained  (see  [14]) : 


w(x,0)  *  w(x,b)  «  w(0,y)  *  w(a,y)  *  0 


4»x(x,0)  -  4>x(x,b)  -  4»y(0,y)  -  4»y(a,y) 


(essential) 


^(O.y)  »  M,(a,y)  -  H2(x,0)  »  M2(x,b)  -  0 


P1(o»y)  *  P2(a»y'  “  p2^x‘°^  *  p2^x>b^  *  0 


(natural) 


where  the  origin  of  the  coordinate  system  is  taken  at  the  lower  left 
corner  of  the  plate.  The  (0°/90o/0°)  croso-ply  laminate  with  the 
following  lamina  properties  and  under  sinusoidal  loading  is  considered 

(g13  *  ®12  and  vi3  *  vi4>  : 


Ei/E2  *  25,  G12/E2  -  0.5,  G23/E2  "  0,2»  V12  ”  0,25 


Figure  1  contains  plots  of  the  nondimensional  center  deflection, 

=  -  «(f .  o,  Vj  x  104 

f  a 
o 

versus  side  to  thickness  ratio  of  square  laminates.  The  deflections 
predicted  by  the  higher-order  theory  are  in  good  agreement  with  the  3-D 
elasticity  theory  (see  [17]) .  The  effect  of  not  including  the  transverse 
shear  strains  on  the  deflections  is  significant  for  side  to  thickness 
ratios  smaller  than  20,  as  can  be  seen  from  the  difference  in  the 
solutions  predicted  by  the  classical  laminate  theory  (CLT)  and  the  first 
and  higher-order  shear  deformation  theories  (FSDT)  and  HSDT) .  The  present 
higher  order-theory  gives  more  accurate  deflections  than  the  first  order 
theory. 

As  explained  in  the  introduction,  the  higher-order  theory  represents 
the  parabolic  distribution  of  the  transverse  shear  stresses  whereas  the 
first-order  theory  represents  only  constant  distribution  through 
thickness.  The  distributions  of  o  (“O.)  through  the  thickness  (computed 
from  the  constitutive  equations)  as* given  by  the  two  shear  deformation 
plate  theories  and  the  3-D  elasticity  theory  are  compared  in  Fig.  2.  Both 
plate  theories  suffer  from  the  discontinuities  at  the  lamina  interfaces 
because  the  continuity  of  stresses  across  interlamina  is  not  impooed. 
Relatively,  the  higher-order  theory  gives  more  accurate  solution  than  the 
first  order  theory.  The  average  value  of  a  at  the  lamina  interfaces 
given  by  the  higher-order  theory  is  closer  to  the  elasticity  solution  than 
the  first-order  theory  solution. 

Lastly,  Fig.  3  contains  plots  of  cr  ( mo£  versus  side  to  thickness 
ratio.  The  higher-order  theory  again  snows  an  Improvement  over  the  first 
order  theory. 

V.  SUMMARY  AND  CONCLUSIONS .  A  third-order  shear  deformation  theory 
based  on  assumed  displacement  field  is  presented  for  laminated  plates. 

The  theory  accounts  for  parabolic  distribution  of  the  transverse  shear 
stresee*  and  hence  does  not  require  the  use  of  shear  correction 
coefficients.  In  general,  the  theory  is  more  accurate  than  the  first- 
order  theory  while  both  theories  contain  the  same  displacement  functions. 

Existence  and  uniqueness  of  solutions  to  the  variational  problem  of 
the  theory  are  not . available  at  this  writing.  The  proof  of  the 
elllptlclty  in  a  proper  vector  space  is  considerably  involved  and  requires 
attention. 
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Figure  1  The  effect  of  transverse  shear  deformation  on  the 
deflections  of  a  square  laminate  (0°/90#/0°)  under 
sinusoidal  transverse  load 
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ABSTRACT.  A  discussion  of  what  is  called  the 
anisoparametr ic  finite  element  method,  suited  for  penalty 
constraint  formulations,  is  presented  in  the  context  of 
displacement  flexure  elements.  Two  auxiliary  criteria  for  the 
kinematic  field  are  elaborated,  on  which  basis  the  desired 
interpolation  assumptions  are  established.  Much  attention  is 
devoted  to  the  lowest  order  trianqular  plate  element.  Several 
numerical  experiments  are  conducted  to  ascertain  the  element 
performance. 

1.  INTRODUCTION.  Numerous  problems  in  solid  mechanics  are 
treated  by  finite  element  models  based  upon  (exterior)  penalty 
constraint  variational  methods  (see  e.g.  111).  Refined  flexure 
theories  that  extend  classical  assumptions  to  account  for 
transverse  shear  deformation  and  rotary  inertia  effects  are 
representative  of  these  methods  (e.g.,  (2-51).  In  these 
theories,  the  transverse  shear  deformation  energy  serves  as  a 
penalty  constraint.  Its  purpose  is  to  fulfill  the  classical 
assumption  of  negligible  transverse  shear  as  well  as  to 
accommodate  the  non-classical  (shear-weak)  deformation  regime. 

There  has  been  an  extensive  effort,  and  for  the  most  part 
unsuccessful,  to  adopt  the  well-established  isoparametric 
methodology  to  penalty  constraint  formulations.  The  essential 
feature  of  the  isoparametric  approach  is  that  interpolation 
assumptions  are  independent,  and  the  same  parametric  shape 
functions  describe  spatial  variations  of  field  variables  and  the 
element  geometry  (  super-  and  sub-parametric  schemes  employ 
respectively  higher  and  lower  order  interpolations  for  the 
element  geometry,  see  (5)). 

One  major  drawback  resulting  from  the  isoparametric  approach 
is  manifested  by  overly  stiff  behavior  in  the  classical  regime. 
This  phenomenon  is  termed  'shear  locking'.  Improvements  have 
been  achieved  by  the  use  of  selective/reduced  integration 
procedures,  which  effectively  alleviate  locking  f7-im .  However, 
such  remedies  are  often  accompanied  by  spnriuos  mechanisms  that 
render  these  elements  unreliable  for  qeneral  applications. 

In  this  communication,  a  consistent  finite  element 
methodology,  termed  anisoparametric,  is  discussed  in  connection 
with  linear  displacement  models  for  penalty  constraint  flexure 
elements.  The  distinguishing  feature  of  this  methodology  is 


manifested  by  the  kinematic  interpolations  that  obey,  in  addition 
to  the  standard  convergence  criteria  (e.g.,  see  f5)),  tie 
criteria  impelled  by  the  penalty  constraints.  The  kinematic 
variables,  in  order  to  meet  these  auxiliary  requirements,  employ 
distinct  deqree  parametric  polynomials  (hence,  the  term 
anisoparametr ic) .  These  interpolations  make  possible  a 
convenient  explicit  degree-of-f reedom  reduction  procedure 
performed  a  priori  to  the  element  matrix  integrations. 
Consequently,  simple,  isoparametric-like  nodal  patterns  and 
compatible  kinematic  fields  can  be  obtained.  Several  such 
elements  nave  recently  been  proposed  for  the  analysis  of 
Timoshenko  beams  (313#  axisymmetric  shells  r  3  21  and  plates 
m-153  .  These  elements  do  not  suffer  from  ‘’locking'  and 
overall  are  superior  in  performance  to  their  reduced  inteoration, 
isoparametric  counterparts. 

The  latest  plate  elements  0  4,153  employ  so-called 
element-appropriate  shear  corrections,  which  accelerate  the 
solution  convergence.  This  aspect  is  addressed  here  briefly. 
Much  of  the  discussion  is  focused  on  the  lowest  order  triangular 
element,  whose  complete  derivation  may  be  found  in  051.  Several 
numerical  examples  are  carried  out  with  this  element  to 
demonstrate  its  performance  characteristics. 


2.  'ISOPARAMETRIC 


associated 
examine  a 


SHEAR  LOCK I MG' 
with 


_  To  ascertain  the  cause  of 

a  typical  penalty  constraint 
element  'displacement'  approach 


difficulties 

formulation,  we  examine  a  plate 
according  to  the  refined  theories  of  Reissner  03  or  Mindlin  03. 
Using  either  theory,  the  strain  energy  functional  for  a  linearly 
elastic,  isotropic  plate  element  of  thickness  h,  area  A,  Young's 
modulus  E  and  Poisson's  ratio  u  takes  the  form  (refer  to  Pig.l 
for  the  plate  notation)  : 

0  -  %W  in  SJ<X  *  2,9yiXeXjy  +  r/  y  +  1/2<1-U)(0y  y  +  S^n  dxdy 


+  a//l/A[  Yxz  +  yyz3  dxdy} 


(2.1) 


where  the  first  and  second  integrals  represent  nondim ?nsional 
bending  and  .transverse  shear  strain  energies,  respectively; 

-  —  -  * *  »  -  •  •  ’  •  •  •  ♦  '  *»  *  -•  *•  »  »  /U'  1  —  U  ~ 


U=Eh“/3 2(1-  u2)  is  the  bending  rigidity;  o  = 
penalty  parameter;  k  is  the  shear  correction 


«k(l-u)  A/h' 
factor;  and 


is  the 


xz 


W. 


+  9 


and  y  = 
yz 


w,  +  0 

y  * 


(2.2) 


are  the  transverse  shear  (penalty)  strains;  and  a  ccmma  is  used 
to  denote  partial  differentiation. 

The  second  integral  in  (2.1)  may  be  interpreted  as  a 
penalty  constraint  functional.  Its  purpose  is  twofold:  (1)  to 
enforce  the  classical  limit  of  vanishing  transverse  shear  as  the 
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plate  approaches  its  th 
constraints  Yxz,Yyz  — >f 


s  its  thin  limit  (i.e.,  as  n — >n  tne  KirchnoL*- 
,Yyz  — >P  are  enforced);  (?.)  to  accommodate  tne 
(thick)  regime  accounting  for  the  transverse  shear 


non-classical 
et  feet. 


It.  is  emphasized  that  the  liniting  classical  regime  is  the 
critical  one  for  the  element  behavior.  As  t.ie  penalty  parameter 
tends  to  infinity  as  n — >d,  it  is  desirable  to  ensure  that  the 
vanishing  snear  strains  do-  not  originate  any  spurious 
constraining.  This  is  because  the  spurious  constraining,  often 
referred  to  as  'locking',  constitutes  a  major  stumbling  block 
leading  to  excessively  stiff  and  essentially  useless  solutions. 


Practically  all  isoparametric  elements  (i.e.,  elements 
employing  tne  same  parametric  functions  for  the  deflection  ,w, 
and  normal  rotation  ,  0X  an(*  Gy  ,  variables,  see  e.g.  f^l)  suffer 
from  the  locking  effect  under  'normal'  integration  of  the 
stiffness  matrix  (  a  'normal*,  integration  implies  the  minimum 
order  numerical  (usually  Gauss)  quadrature  rule  that  produces 
exact  integrals  taken  over  a  rectangular  (or  triangular)  plan) . 
At  first  glance,  it  appears  paradoxical  that  such  an  approach 
leads  to  worthless  results  even  though  all  of  the  fundamental 
convergence  requirements  are  fulfilled  (i.e.  rigid  body  modes, 
constant  strain,  compatibility  and  differentiability  criteria, 
r*l )  .  However,  upon  examination  of  the  penalty  strains  while 
under  Kircnhoff  constraints,  the  cause  of  the  spurious 
constraining  becomes  apparent. 


To  illustrate  this  point,  consider  an  isoparametric  ,  linear 
triangular  element  located  in  the  x-y  Cartesian  framework  whose 
origin  is  at  the  element  centroid.  The  kinematic  interpolations 
mav  be  written  as 


w  =  c00  +  C10X  +  C01y 


(2.3) 


°i  =  doo>  +  dio)x  +  doi)y  ’  (i=X’y) 


where  c,  and  d.,  1  are  generalized  constants  dependent  upon  w. 
and  G.  .  (  l<j£3  )  degrees-of-f reedom  (dof) ,  respectively.  Th3 
Kirchho¥%  constraints 


0  -e  Y  =  w,  +0 
xz  x  y 


_  .  ,,(yX  .  .(y)v  , 

■  (cl0  +  d00  5  +  d10  x  +  d01  y 


(2.4) 
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'yz  *y  j: 

.  (c  +  <j^x^  +  d^x  +  dvX^v 
{C01  00  '  a10  x  a01  y 


P.F) 


are  enforced  in  the  limit  as  h — >n.  If  the  sheer  strain  energy 
is  integrated  exactly,  which  is  in  accordance  with  variational 
requirements,  then  all  six  constraints  resulting  from  (2.4,8)  are 
approximately  enforced.  Among  these,  the  four  constraints. 


j(y)  j(y>  *(*)  j  GO  n 
d10  >  d01  *  dio  •  d01  *  0 


(2.*) 


exclusively  dependent  oh  the— got-a±Aonal  dof.  are  the  spurious 
(nonphysical)  ones.  Upon  their  enforcement^  normal  rotations 
(2.3b)  can  assume  either  zero  or  constant  values,  and  this  causes 
the  bending  energy  to  vanish  (see  (2.1)).  The  practical 
implication  is  that  excessively  stiff  results  are  obtained  ,i.e, 
the  solution  is  'locked'  due  to  spurious  she'ar  constraining. 

To  alleviate  the  spurious  locking  phenomenon  within  the 
framework  of  the  isoparametric  assumptions,  a  one-point 
centroidal  quadrature  on  the  shear  strain  energy  might  be  used, 
which  effectively  'drops’  the  spurious  constraints  (i.e.,  (2.4, 
5)  are  evaluated  at  x=y=P.)  <?uch  reduced  integration  schemes, 
though  often  plagued  by  one  or  more  spurious  zero  energy  modes, 
have  been  used  widely  due  to  their  simplicity  accompanied  with 
significant  improvements  in  results  (e.g. ,  refer  to  T7,81). 

The  locking  phenomenon  can  also  be  linked  to  the  inability 
of  the  isoparametric,  exactly  integrated  element-:  to  represent 
the  state  of  constant  shear,  even  though  a  constant  term  is 
present  in  each  shear  strain.  To  observe  this,  consider  a  simple 
one-dimensional  example  —  a  cantilevered  beam  loaded  by  a  shear 
force  at  the  tip.  Assuming  the  beam  axis  coincides  with  the 
x-riirection  and  setting  y=^  (i.e.,  considering  a  beam  element), 
it  becomes  apparent  tnat  the  theoretically  .constant  shear  state 
(Y  =  const.)  can  only  be  achieved  if  d.,  y,=0.  However,  in  the 
linear  isoparametric  element. 


1 


d  (y) 
n]a  ' 


0.-9. 

yi  y2 


(2.7) 


» 

I 


and  for  d._  y  t r  vanish,  the  two  rotational  dof  must  be  equal  or 
be  identically  zero.  Put  in  this  case  ,  again,  the  bending 
enerqy  vanishes  and  locking  is  encountered.  On  the  other  hand,  a 
non-locking  solution  can  be  obtained  in  the  thick  regime  P)  ,  in 
which  case  £  0y?,  yielding  a  linear  (not  constant!)  shear. 

It  then  follows  that  the  isoparametric  approach  should  be 
regarded  inadequate  for  the  treatment  of  the  penalty  functional 
of  (2.1). 
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3.  AUXILTAPY  CRITERIA. 


non-locking  solutions,  it 
arineres  to  tne  auxiliary, 
requirements : 


In  order  to  produce  effective 
is  proposed  that  the  kinematic  field 


penalty-constraint  impelled 


Criterion  1  .  Constant  penalty  sttair.s  must  he  accommodated  when 
such  deformation  states  are  imposed  upon  a  finite  size  element. 


Criterion  7.  +he  classical  limiting  condition  of  vanishing 
shear  strains  must  be  attainable  without  engendering  any  spurious 
constraining . 


These  criteria  point  toward  tne  net: on  of  ’proper’  shear 
strain  polynomials  r)3i.  In  tnese  polynomials,  each  generalized 
coordinate  is  represented  by  a  linear  combination  of  the 
deflection  and  normal  rotation  dof,  i.e.,  they  possess  no 
spurious  terms  dependent  on  a  sinole  independent  variable.  It 
can  be  readily  ascertained  that  the  proper,  complete  polynomials 
are  perfectly  suited  for  trianqular  elements  (also 
one-dimensional  elements  such  as  beams  and  axisymmetric  sneils  , 
see  fl.l,l?1  ).  Such  shear  strain  polynomials  of  degree  P  may  be 
expressed  as: 

def 

0.,)  * 
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P  P 
E  L 
o-O  8=0 


(A  \  G  8 
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a(j>  *  0  V  (a  +  3><  p  , 
a8 

a(3)  =  *°V,  0ik),  (j=*,y  r  i-y,*  t*H  p=i,2,...) 

c-.8  aB 

wnere  w,  and  0  denote  the  appropriate  nodal  dof. 

iv  IK 

Tne  form  of  (3.1)  suggests  tnat  tne  polynomial  for  w  must  be 


one  degree  higher  than  that  for 


0. 


Pence,  the  desired 


triangular  element  interpolations  can  be  written  as 
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represented  by  the  same  complete  polynomials  as  the  normal 

rotations 


in  (3.1).  Consequently,  the  deflection  slopes  w,  .  are 


0i’s. 
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Tt  must  he  remarked  that  fulfillment  of  Criterion  1  will 
generally  guarantee  satisfaction  of  Criterion  2  as  well,  except 
when  certain  *  overrestrained '  boundary  conditions  are  prescribed. 
The  interested  reader  is  referred  to  U51  for  a  detailed 
discussion  on  this  subject. 

ftNISOPARAMETFIC  INTERPOLATIONS.  Interpolations  (3.2) 
once  expressed  in  terms  of  the  element  parametric  coordinates 
will  he  referred  to  as  anisoparametric.  The  term  is  meant  to 
emphasize  t.ne  distinct  order  of  the  kinematic  interpolations. 
The  well-established  isoparametric  approach  would  then  constitute 
a  sub-class  of  tne  anisoparametric  strategy. 

In  the  derivation  of  a  triangular  element,  it  is  convenient 
to  utilize  parametric  coordinates  expressed  in  terms  of 
subtriangle  areas  (refer  to  ?ig.2)  ,  i.e., 

3 

(  ?2»  ^  ^  *  1  /A  (  A^i  9  A^) »  with  Z  A^  **  A  (4*1) 

The  unique  linear  relation  between  Cartesian  and  parametric 
coordinates 

3  3  3 

(1*  x»  y)  *  (  £  Cj  £  5..X**  2  >  M.2) 

i-1  1*isal  i=l  1 


permits  an  inverse  relation  as  well.  The  simple 
formula 

integration 

rr  *.a  WC  j  .  a*  b«  c.  n , 

ff  ?2  C3  dxdy  (a+w-c+2):  2A 

(4.3) 

allows  for  the  explicit  exact  quadratures  of 
matrices. 

all 

element 

The  initial  kinematic  assumptions  for  the 
interpolation  order  element  (p  =  1)  are  taker  as  (refer 
for  the  nodal  pattern) : 

lowest 
to  Fig. 3 

quadratic  deflection 

w  »  N  w 

- v 

(*.«) 

where  tne  row  vector  of  quadratic  trial  functions  and  the 
associated  vector  of  nodal  dof  are: 

N  *  “  {Wj}  ,  (i  »  1,...,6) 
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ck  .  (i  -  1.2,3;  k  -  2,3,1) 


('.'a) 


0.  =  ?  0.  ,  i  -  x,y 

i  - 1 
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wnere  the  row  vector  of  linear  trial  functiv  *i  and  the  associated 
vector  of  nodal  dof  are: 


1  4  -  {04U},  k  =  1,2,3 


ik 


('’.Fa) 


Equations  (A. A,  5)  can  be  directly  used  in  formulating 
element  matrices.  However,  it  may  be  advantageous,  from  the 
standpoint  of  nodal  simplicity,  to  condense  out  the  -aid-edge 
deflection  dof.  This  can  be  accomplished  in  the  tollowing 
manner:  enforce  continuous  shear  constraints  at  every  element 
edge  as  given  by  the  differential  relation 


T«.,  -  <»•*  +  VJ  V0  5  °-  (1  -  1-2-3) 


M.G) 


where  s  denotes  the  edge  coordinate  and.  0  is  the  tangential 
edge  rotation  (refer  to  Fig.  3) .  The  enforcement  of  constraints 
(4.^)  at  the  three  element  edges  yields: 


wi+3  -  1/2  (w.  +  vj}  +  1/8  [bk(0xi  -  0xj)  +  ak(0yj  -  0yi>] 
(i  -  1,2,3;  j  -  2,3,1;  k  -  3,1,2) 


(4.7) 


Upon  substitutino  (4.7)  into  w  in.  (4.4),  there  results  a 
constrained  deflection  field  exclusively  in  terms  of  vertix  dof 
(refer  to  £ig.  3  ),  i.e., 


w=cw  +  L0  +  H  0  , 

-a.  —  - x  —  ~y 

L  «•  {L±>,  M  =  w  *  {wi>,  0*  «  {©xi>,  ©y  -  {0yi>, 

=  1/b  (bkN±+3  -  bj\+3),  Mi  =  1/8  (aj\+3  ~  *1^^)  » 

-  1.2,3;  j  -  2,3,1;  k  -  3,1,2* 

where  the  conditions 
3  3  3 

Z  Lt  =  0,  Z  M4  =  0,  Z  ^  =  1 


(4.P) 


(4.9) 
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guarantee  satisfaction  of  the  fundamental  constant  strain 
criterion. 

The  deflection  given  by  (<* .?)  possesses  continuity  of  order 
r  (C'*)  witnin  the  element  and  across  its  edges.  Togetner  with 
the  linear  rotation  assumptions  this  constrained 

def lectio  field  of  tne  second  decree  is  readily  employed  in  tne 
formulativ.  i  of  the  element  matrices.  Further  formulation 

details  of  this  element,  designated  as  VTN3 ,  m?y  be  found  in 
. 

5.  ELFMENT-APPRCPP^ATE  SHEAR  CORRECTION.  The  major  impetus 
contributing  to  tne  performance  enhancement  of  the 
anisopar^metr  ic  elements  f  1  r  1  ^  1  is  the  notion  of  the 
element-appropr iate  shear  correction  factors.  This  idea  is  based 
on  the  assertion  that,  in  the  context  of  finite  element 
(in-plane)  approximations,  additional  and  often  unwanted 
kinematic  constraining  commonly  occurs.  It  may,  therefore,  be 
desirable  to  introduce  an  element  (in-plane)  correction  ,  <J>  ,  to 
complement  the  classical  (throuah-the-thickness)  ^-factor.  Such 
a  compounded  factor  is  element  dependent  and  can  be  expressed  in 
a  separable  form,  i.e.,  for  the  n-th  element  in  the 
discretization,  we  have 


ke  «  <j>e  k  (5.1) 

n  rn 

whereas  the  classical  correction  usually  pertains  to  the  whole 
plate.  In  r  1 51  ,  by  employing  an  energy  matching  procedure,  <j>e 
emerged  in  the  form  n 


n  +  (Cb  -  D^f1 


(5.2) 


where  C.  is  an  invarient  element  constant  established  by  a 
numerical  experiment;  ty2  represents  a  characteristic 
material-geometric  parameter11  which  can  be  linked  to  tne  element 
stiffness  penalty  parameter.  Two  distinct  approaches  for 
determining  p2  are  proposed  in  (14,151,  The  interested  reader 
may  consult  these  references  for  further  details. 

5 .  NUMERICAL  EXAMPLES.  Several  numerical  experiments 
dealino  with  isotropic,  homogeneous  plates  are  presented.  The 
examples  are  chosen  to  demonstrate  MIN3's  performance  in  the 
critical  patch  tests  and  standard  convergence  problems. 

Kirchhoff  Patch  Test  The  popular  tnin  plate  patch  test  f]cl  is 
undertaken.  A  thin,  rectangular  plate  is  subjected  to  the  state 
of  bending  such  that  all  three  moment  resultants  are  constant 
throughout  the  plate  domain.  The  plate  is  discretized  v.’ith  four 
geometrically  distinct  MIN3  elements  as  depicted  in  Fig.d.  The 
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model  apparently  recovers  exactly  the  three  constant  moments  in 
each  of  the  elements. 

Sneer  Paten  Test  Since  the  present  refined  theory  involves 

transverse  shear  stress  resultancs,  it  may  be  desirable  to 
conduct  an  appropriate  shear  patch  test.  To  do  this,  we  propose 
to  examine  an  infinitely  Iona  clamped  plate  subjected  to  an 
uniform  transverse  line  load  ,  0  ,  along  the  lonq  free  edge  (see 
Fig. 5a).  Here,  tne  state-  of  constant  o  -shear  exists  (i.e., 
0  =r  )  ,  while  O  is  zero  everywhere  inene  plate.  The  narrow 
s£?ipzfour  elemer$?  model  subject  to  symmetric  boundary  conditions 
(Fig. 5b)  produced  exact  constant  values  of  0  and  O  in  every 
element.  '  ^ 

Thin  Square  Plates  Tnin  square  plates  are  commonly  used  to 

assess  convergence  characteristics  of  plate  bending  elements. 
Figure  7  depicts  convergence  curves  for  problems  solved  with 
different  mesh  patterns,  loadings  and  boundary  conditions  (refer 
to  Fig.  5) ,  where  MIN3  solutions  are  compared  with  those  of  DKT 
(non-conforming,  thin  triangular  element,  r]7l).  The  present 
element  performance  is  seen  to  be  very  competitive. 

n ,  CONCLUDING  REMARKS.  The  anisoparametric  finite  element 
method,  requiring  two  auxiliary  kinematic  field  criteria,  has 
demonstrated  to  be  effective  for  the  treatment  of  the 
penalty-constraint  flexure  formulations  fll-1^1 .  The  method 
relies  on  an  exact  (normal)  integration  and  ,  hence,  assures 
consistent  and  kinematically  reliable  (correct  rank)  models.  It 
takes  advantage  of  an  explicit  dof  reduction  technique,  achieved 
via  ’continuous’  shear  edge  constraints,  which  generates 
compatible  kinematic  fields  and  simple  isoparametric-like  nodal 
configurations.  There  exists  no  ambiguity  reqa.rdinq  derivation 
of  consistent  load  vectors  and,  in  dynamics,  consistent  mass 
matrices.  The  anisoparametric  elements  suffer  no  deficiencies  in 
either  the  classical  (thin)  or  shear-weak  (thick)  flexure  regimes 
and  at  the  same  time  produce  rapidly  convergent  solutions. 
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DATA:  E  “  1000.0,  o  -  0.3,  h  -  1.0 

BOUNDARY  RESTRAINTS:  v  -  0  at  node*  1,  2  and  4. 

ANALYTIC  SOLUTION:  M__  -  M  -  -  1.000  throu*hout  the  plate. 

/* 


Fig.  4  Thin  plate  patch  test 
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IMPORTANCE  OF  CRACK  TIP  SHAPE  IN  ELASTIC-PLASTIC  FRACTURE  ANALYSIS 
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ABSTRACT*  Elastic-plastic  solutions  for  two  crack  problems  are 

-  "s 

discussed.  The  problems  involve  . 'blunt  cracks.  In  one  case,  the  crack 
tip  geometry  is  semi-circular  with  parallel  flanks;  in  the  other  case, 
a  circular  tip  geometry  is  considered.  We  refer  tc  them  as  the 
"U-tip*  and  "keyhole"  crack  problems.  The  cracks  have  finite  length 
and  are  within  an  infinite  domain  under  plane  strain  constraint.  Far 
from  the  crack  a  uniform  tensile  stress  state  is  specified.  The 
solution  history  to  a  remote  stress  level  equal  to  2R%  of  the  yield 
stress  was  obtained  for  each  problem.  Slipline  analysis  suggests  that 
the  near  tip  stress  distribution  for  these  cases  and  others  such  as 
the  narrow  ellipse  and  perfectly  sharp. crack  are  significantly 
different.  At  least  for  the  acuity  and  load  level  considered,  the 
elastic-plastic  analyses  show  that  the  U-tip  and  keyhole  crack 
solutions  are  virtually  identical.  The  slipline  maximum  stress  value 
which  is  commonly  used  in  fracture  analysis  is  not  achieved  in  our 
solutions.  This  is  seen  to  follow  from  the  fact  that  the  top  of  the 
crack  tip  surface  does  n,t  reach  fully  plastic  conditions.  Although 
the  elastic-plastic  boundary  reaches  the  top,  plastic  straining  occurs 
there  only  temporarily.  Local  unloading  develops  with  the  result  that 
only  the  surface  in  the  angular  range  ±75°  is  included  in  the  zone  of 
plastic  flow  at  the  later  stages  of  loading. 

INTRODUCTION:  Criteria  for  ductile  crack  extension  are  often  based 
upon  the  stress  and  deformation  states  suggested  by  continuum  crack 
tip  analysis.  The  shape  of  the  crack  tip  can  be  an  important  aspect 
of  such  analysis,  as  can  be  readily  seen  from  slipline  solutions. 
However,  in  many  cases,  the  sharp  crack  model  is  appropriate. 

Solutions  then  are  singular  at  the  crack  tip  and  fracture  criterU* 
utilize  singularity  amplitudes  such  as  K  or  J.  When  the  opening  at 
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the  crack  tip  is  significant  on  the  microscale  of  the  fracture 
investigation,  the  sharp  crack  singular  solutions  do  not  provide  the 
necessary  local  detail,  so  that  then  solutions  which  account  for  the 
crack  tip  opening  and  shape  are  necessary. 

There  has  been  very  little  work  done  on  the  elastic-plastic  blunt 
crack  problem.  Plasticity  effects  are  generally  discussed  in  terms  of 
the  local  slipline  predictions,  even  though  the  limits  of 
applicability  of  these  predictions  are  often  unknown.  There  has  been 
noteworthy  work  published  in  the  current  literature  on  the  solution 
ahead  of  cracks  which  have  blunted  as  a  result  of  large  plastic 
deformations  incurred  during  loading.  In  the  p.  ^-nt  work,  ws  are 
concerned  with  the  flaw  which  is  blunted  in  the  stress-free  state  and 
which  is  not  so  sharp  that  finite  deformation  blunts  it  further  in 
contained  plasticity  situations.. 

With  numerical  approaches  such  as  the  finite  element  method,  the 
single  most  difficult  feature  of  the  blunt  flaw  problem  is 
accommodating  the  relative  smallness  of  the  root  radius  in  the 
discretization.  In  the  analysis  of  engineering  notches  or  even  sharp 
cracks,  the  controlling  dimensions  are  usually  of  the  same  order  of 
magnitude.  In  contrast,  for  the  cases  that  we  discuss  below,  the  root 
radius  is  smaller  than  the  crack  lenqth  by  a  factor  of  2000.  Since 
details  of  the  stress  variations  in  the  region  influenced  by  the  root 
geometry  is  desired,  grid  dimensions  must  be  fractions  of  the  root 
radius. 

We  have  developed  i  numerical  formulation,  Ref.(l),  which  can 
readily  accommodate  the-  elastic-plastic  blunt  crack  problem, 
especially  those  cases  which  have  plastic  zone  dimenui .,ns  sr.  'll 
compared  to  crack  length.  The  formulation  naturally  treats  infinite 
domains  so  that  solutions  free  of  extraneous  boundary  effects  can  be 
obtained.  In  a  region  surrounding  the  crack  tip  finite  element 
methodology  is  used.  The  region  size  is  chosen  according  to  the 
plastic  zone  extent  that  is  anticipated.  Over  the  remainder  of  the 
domain,  the  planar  elasticity  complex  stress  function  method  is 
employed.  Boundary  collocation  techniques  are  used  to  couple  the 
equations  governing  in  the  two  regions.  At  each  step  of  the  analysis, 
discrete  unknownsare  nodal  displacement  increments  in  the  finite 


element  region  and  coefficients  of  a  power  series  approximation  to  the 
governing  stress  function  in  the  remainder  of  the  domain.  The  unique 
value  of  the  formulation  lies  in  its  capability  to  accurately  treat  an 
infinite  elastic  region  with  a  single  low  order  power  series. 

The  elastic-plastic  solution  for  a  cracklike  elliptical  flaw  under 
cylic  loading  wst  presented  in  Ref.  (1).  The  non-hardening 
Prandtl-Reuss  elastic-plastic  constitutive  law  was  used  in  the 
formulation.  The  flaw  is  cracklike  in  the  sense  that  its  length  is 
much  greater  than  the  radius  of  curvature  of  its  ends,p  .  The  small 
finite  element  region  used  in  the  analysis  restricted  the  remote 
stress  level  to  12%  of  the  yield  stress  V  and  the  maximum  plastic  zone 
extent  to  7,5 p  .  Near  the  crack  tip  the  results  were  in  excellent 
agreement  with  the  appropriate  slipline  solution.  Departures  from  the 
solution  and  the  approach  to  the  purely  elastic  distribution  were 
discussed.  As  an  example,  at  peak  load  with  the  elastic-plastic 
boundary  at  a  distance  3.3 p  ahead  of  the  flaw,  the  stress  solution 
agreed  with  the  slipline  distribution  over  roughly  one-half  this 
plastic  zone  extent.  While  the  agreement  in  this  inner  region  was 
within  1%,  at  the  elastic-plastic  boundary  the  slipline  stress  result 
of  2.36Y  was  13%  too  high.  The  results  graphically  demonstrate  that 
slipline  predictions  are  valid  only  over  a  portion  of  the  plastic 
zone,  essentially  that  portion  far  enough  away  from  the 
elastic-plastic  boundary  so  that  fully  plastic  flow  is  kinematically 
possible. 

Slipline  analysis  suggests  that  the  stress  gradient  near  a 
traction  free  circular  boundary  is  more  severe  than  that  near  an 
elliptical  boundary.  If  a  crack  tip  is  formed  by  a  semi-circle  with 
parallel  flanks  at  +  90°  (U-tip) ,  the  normal  stress  rises  from  the 
surface  value  of  1.15Y  to  a  stress  plateau  near  3Y,  at  a  distance  3.8 p 
ahead  of  the  crack.  At  this  location  the  stress  value  is  2.4Y  for  the 
cracklike  ellipse.  If  the  crack  tip  is  a  complete  circle  (keyhole 
crack) ,  the  stress  can  increase  to  a  maximum  of  4.78Y,  the  exact 
result  depending  upon  the  portion  of  the  tip  surface  and  surrounding 
material  which  yields  and  reaches  fully  plastic  conditions.  The 
slipline  stress  distributions  for  the  cracklike  ellipse,  U-tip  and 
keyhole  cracks  are  compared  in  Fig.(l).  In  the  following,  we  present 
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results  from  the  elastic-plastic  analysis  of  the  latter  two  problems, 
delineating  those  aspects  which  define  the  limits  of  applicability  of 
the  slipline  predictions. 

FORMULATION;  The  numerical  method  used  in  this  work  has  been 
discussed  in  Ref.  (1).  It  combines  features  of  the  elastic  complex 
variable  stress  function  approach  discussed  by  Bowie  and  Freese,  Ret. 
(2) ,  and  the  elastic-perfectly  plastic  incremental  finite  element 
approach  discussed  by  the  authors.  Ref.  (3).  A  very  important  aspect 
of  the  method  for  our  cases  of  long,  narrow  flaws  involves  conformal 
mapping  and  analytic  continuation.  This  allows  the  traction  free 
crack  boundary  condition  to  be  implicitly  satisfied  so  that 
discretization  and  collocation  are  necessary  only  at  the  ends  of  the 
flaw.  This  very  powerful  aspect  can  be  used  if  a  mapping  function 

z»  to({)  is  known  which  maps  the  unit  circle  onto  the  flaw.  The 

\ 

mapping  function  for  the  elliptical  flaw  (semi-axis  a,b)  is  of  course 
known  and  this  function  applies  to  the  limiting  case  of  a  slit  or 
sharp  crack  with  b*0.  We  use  the  slit  in  the  present  work  to  represent 
the  crack  in  the  elastic  region.  The  desired  blunt  shape  is  taken  to 
be  connected  to  the  slit  in  the  finite  element  region.  Whereas  the 
general  planar  elasticity  problem  requires  determination  of  two 
analytic  stress  functions,  continuation  reduces  the  problrn  to  finding 
a  single  function  $(s)  which  satisfies  the  renu  •»  stress  condition  and 
th~  equilibrium  and  compatibility  coupling  conditions  along  the  finite 
element  interface.  While  formally  there  is  an  interface  around  each 
end,  symmetry  allows  us  to  treat  a  single  quadrant  of  the  plane  in  our 
analysis. 

The  interface  is  defined  in  the  auxiliary  p?„ane  as  a  circle  of 
radius  R  centred  at  the  end  of  the  slit,  so  that  the  problem  in  the 
elastic  region  is  to  find  4>  outside  the  disks  |r±l|<  R  .  The 
approximation  used  had  <^S  as  the  sum  of  the  purely  elastic  function  and 
a  15  term  Laurent  series  expanded  about  the  ends  of  the  slit 

15  2  n 

4>  -  T  [ac  -3a/c  3/8  +  1  a  e  /(c  -1) 

n-1 

The  radius  R  was  chosen  in  the  present  analysis  so  that  the  finite 
element  region  extended  a  distan' e  of  75 p  from  the  flaw  tip  at  70° 
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above  the  flaw  direction.  In  the  physical  plane  tne  interface  is  a 
non-circular,  smooth  contour  ending  on  the  faces  of  the  slit.  The 
piecewise  linear  interface  shown  in  Fig.  (2)  corresponds  to  the  finite 
element  edges  there. 

The  finite  element  grids  consisted  of  quadrilaterals,  each  div  .ded 
by  its  diagonals  into  four  constant  state  triangles  to  accommodate 
incompressible  deformation.  There  were  rou^ily  1000  nodes  and  2pco 
triangles  in  each  grid.  The  finite  elements  defined  the  blunt  ends  of 
the  cracks.  In  the  U-tip  problem,  the  slit  of  the  elastic  formulation 
opens  to  a  parallel  faced,  s«.mi-circular  ended  slot  of  length  20?, 
while  in  the  keyhole  problem  a  split  circular  boundary  is  connected  to 
the  slit.  In  both  cases  the  tip  radius  was  taken  to  be  a/100P.  While 
a  more  natural  U-tip  problem  would  have  a  uniform  opening  along  the 
entire  crack  length,  there  is  no  known  mapping  function  for  this  case. 
There  is  little  reason  to  expect  tnat  the  geometry  used  will  have  a 
ne&r  tip  solution  significantly  different  than  this  case. 

The  coefficients  in  the  series  are  real  numbers  and  tney  are 
determined  by  matching  force  and  displacement  conditions  at  the  nodal 
points  of  the  interface.  Actually  in  the  incremental  analysis  we  deal 
with  increments  of  force  and  displacements  corresponding  to  remote 
stress  changes  AT.  Force  increments  are  expressed  in  terms  of  an  at 
the  nodes  and  rhis  represents  the  load  transfer  across  the  interface. 
The  usual  stiffness  approach  is  followed  to  develop  the  governing 
equations  in  the  finite  element  region.  Condensation  results  *n  a 
system  of  equations  relating  the  interface  force  and  displacement 
increments.  By  expressing  the  latter  in  terms  of  <f>,  a  system  Is 
obtained  exclusively  *n  terms  of  «lT  A  series  with  fewer  terms  than 
finite  element  interface  degrees  of  freedom  is  chosen  so  that  the 
system  is  over-determined  and  the  solution  is  by  the  least  squares 
method.  With,  knowledge  of  on  ,  displacement,  strain  and  stresr 
increments  a  r  :■  computed  throuebout  the  finite  element  mesh.  The  value 
of  AT  at  each  step  in  the  incrementation  is  araptively  established  ia 
an  iterative  fashion  to  (tceurately  trace  the  details  of  the  plastic 
yielding  and  flow. 


ELASTIC-PLASTIC  RESULTS:  Hie  adaptive  incrementation  technique  determines 
load  step  size  on  the  basis  of  a  specified  deviator ic  stress  change  maximum 
vtfiich  we  chose  to  be  equal  to  0.05Y.  The  result  was  that  51  steps  were  taken 
for  the  plastic  zone  to  extend  to  the  interface.  The  50th  step  is  thus  the 
last  reliable  one  and  it  corresponded  to  a  final  remot°  stress  of  0.28Y. 
First  yield  was  at  the  triangle  borcering  the  tip  surface  at  the  x-axis.  The 
load  at  first  yield  suggests  on  elastic  stress  concentration  factor  of  68.5. 
The  results*  for  the  two  problems  differed  by  only  0.2%.  This  factor  can  be 
compared  to  the  value  64.2  for  an  elliptical  flaw  with  tine  same  a/p  ratio. 

The  grids  used  in  the  two  analyses  are  shown  in  Fig.  (3)  along  with  the 
outlines  of  the  plastic  zones  at  maximum  load.  The  gr>..';'  are  shown  as 
quadrilaterals  but  of  course  each  of  these  consists  of  four  -  ric-.igles.  Ihe 
elements  which  satisfy  the  yield  condition  are  drawn  in  the  rirst  quadrants. 
There  is  remarkably  close  agreement  between  the  results.  Although  the  keyhole 
plastic  zone  extent  appears  greater  in  the  52.5  -  67.5°  range,  actually  the 
two  triangles  which  differ  are  within  1%  of  yield  in  the  U-tip  problem  , 
corresponding  to  the  fact  that  the  load  level  is  0.71  lower  in  this  case. 
Smoothing  suggests  that  the  maximum  extent  of  the  plastic  zone  is  at  60°  and 
approximately  equal  to  64p.  The  plastic  zone  has  the  fxom  associated  with 
sharp  cracks  under  small  scale  tensile  yielding  conditions.  Larsson  and 
Carlsson,  Ref.  (1),  considered  the  problem  of  a  sharp  crack  in  a  finite  plate 
and  found  the  maximum  extent  at  65  and  equal  to  0.23  (K/Y)  .  K  is  the 
elastic  stress  intensity  factor  vhich  'would  equal  T*5ra  for  our  problem.  This 
result  suggests  an  extent  of  57 p ,  11%  less  than  our  result.  Whereas  the 
plastic  zone  extends  a  distance  8.8p  along  the  x-axis  in  our  problem,  the 
sharp  crack  result  suggests  5.5p. 

A  very  significant  aspect  of  the  solution  involves  elastic  unloading  near 
the  top  of  the  crack  tip.  In  the  c.  uly  stages  of  loading  there  is  a  steady 
advance  of  the  elastic-plastic  bou  /Jury  along  the  t'p  surface.  Yielding 
reaches  tne  top  (half  of  the  quad  between  82.5  and  90°  yields),  plastic 
straining  temporarily  occurs  near  the  top  and  then  elements  between  67.5  and 
90°  unload.  It  appears  that  the  yielding  away  from  the  surface,  beyond  the 
90°  ray,  effectively  serves  as  a  load  shedding  mechanism.  The  small  unloaded 
region  can  be  seen  in  Fig.  (4)  vhere  plastic  zones  at  T  =  .14Y  (prior  to 
unloading)  and  nwjximui..  load  are  cc  '  .a red.  The  scale  is  such  that  only  the 
Local  plastic  zone  is  included  for  the  maximum  load.  The  keyhole  results  are 
identical  to  those  shown  in  the  figure. 


The  near  tip  elastic-plastic  stress  distributions  for  the  U-tip  and 
keyhole  problems  are  displayed  in  Fig.  (5)  for  six  stages  of  loading.  The 
results  are  virtually  identical  for  the  two  problems.  Tbe  plot  shows  °yy/Y 
vs.  x/p  tfiere  x  is  measured  from  the  circular  surface.  The  solid  curve  in  the 
figure  is  the  logarithmic  spiral  slipline  result, 

tfyy/7  =  1.15  (1  +  In  (1  -»•  x/p)) 

which  is  plotted  out  to  x/p  =  3.81  as  is  appropriate  for  the  U-tip  flaw  when 
yielding  reaches  the  flanks.  If  yielding  spreads  along  the  flanks,  the  stress 
beyond  3.81/;  remains  constant  at  2.97Y.  The  log  spiral  from  90°  is  drawn  in 
the  figure.. 

The  position  of  the  elastic-plastic  boundary  an  the  x-axis  is  indicated  by 
hatch  marks  in  the  figure.  As  load  increases  from  the  incipient  yield  value 
of  f!0]7Y  to  0.1SY,  it  is  seen  that-  the  slipline  prediction  applies  over  an 
increasing  region.  Beyond  0.1 5Y  this  behavior  ceases  and  the  slipline 
distribution  applies  only  out  to  the  fixed  location  x  *  2.1Rp.  The  log  spiral 
which  intersects  the  x-axis  there  is  drawn  and  it  is  seen  to  intersect  the 
crack  tip  at  56°.  This  behavior  reflects  the  unloading  from  90°  to  the  67.5  - 
75°  range  that  has  been  discussed.  The  slipline  drawn  effectively  defines  the 
limit  to  the  log  spiral  region  for  our  case  of  small  scale  yielding.  Fully 
plastic  conditions  can  develop  beyond  this  slipline,  of  course,  but  the  log 
spiral  distribution  cannot  apply  there.  At  higher  load  levels  the  solution 
beyond  x  =  2.18p  appears  to  be  approaching  a  steady  state,  with  a  maximum 
stress  of  2.57Y  at  x  =  2.76p. 

CONCLUSIONS;  The  results  have  important  implications  in  many  areas  of 
fracture  analysis.  For  instance,  the  prediction  of  cleavage  ahead  of  a  crack 
is  often  based  upon  the  characteristic  fracture  stress  of  a  material.  The 
results  which  show  a  maximum  achievable  stress  different  than  commonly  thought 
is  very  significant  to  this  work.  Another  area  of  application  of  the  results 
is  in  the  assessment  of  fracture  ahead  of  machined  flaws  In  fracture  testing , 
Analytically,  it  remains  to  be  seen  if  the  crack  tip  unloading  phenomenon 
influences  behavior  once  finite  strains  develop  at  the  tip.  The  effect  of 
strain-hardening  on  the  observed  behavior  is  another  area  for  investigation. 
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FIGURE  4.  ELEMENTS  SATISFYING  YIELD  INDICATING  UNLOADING  NEAR  TOP  SURFACE 
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Abstract 


A  series  of  critical  experiments  in  the  determination 
of  a  plastic  flow  rule  is  re-examined  using  kinematic 
hardening.  Non-proportional  loading  experiments  on 
thin-walled,  aluminum  tubes  were  conducted  by  Budiansky, 
Dow,  Peters,  and  Shepherd  in  1951  to  determine  whether 
plastic  flow  exhibits  behavior  consistent  with  the  physical, 
slip  theory  of  plasticity  or  with  the  phenomenological ,  J2 
flow  and  deformation  theories.  Their  results  were  mixea 
since  none  of  these  theories  predict  the  full  range  of 
exhibited,  material  behavior.  Pan  and  Rice  have  sparked 
recent  interest  in  these  experiments  by  introducing  a  slight 
rate  dependence  into  slip  theory.  Through  a  judicious 
choice  of  a  strain  rate  sensitivity  parameter  they  match  the 
experiments  reasonably  well. 

This  note  reports  on  the  comparison  of  these 
experiments  against  the  predictions  of  a  flow  rule  based  on 
the  Prager/  Ziegler  kinematic  hardening  theory.  Both  shear 
and  axial  strains  are  predicted  for  a  variety  of  load 
histories.  Results  show  good  agreement  between  theory  and 
experiment.  The  implications  to  buckling  and  instability 
analysis  are  briefly  discussed. 

I .  INTRODUCTION 

In  this  note  a  brief  reassessment  of  some  experiments 
ni  on  plastic  flow  rules  will  be  made  in  light  of  the 
results  predicted  using  Prager/Ziegier  kinematic  hardening. 
First  the  experiments  will  be  described  with  the  original 
comparisons  using  the  models  of  J_  deformation  and  flo'f 
theories  (isotropic  hardening)  and  sli  '■heory.  Then  the 
recent  paper  by  Pan  and  Rice  [21  employ..  ,  rate  sensitivity 
in  the  slip  theory  will  be  shown  to  improve  the  modeling. 
Finally  some  recent  calculations  using  simple  kinematic 
hardening  will  be  presented  and  discussed. 

At  the  First  National  Congress  of  Applied  Mechanics  in 
1951  the  results  of  Some  nonproportional  loading  experiments 
on  thin  wall  tubular  test  specimens  were  presented. 
Budiansky,  Dow,  Peters,  and  Shepherd  [11  conducted  tests  at 
NACA  labs  at  Langley  Field  to  investigate  the  behavior  of 


plastic  flow  near  the  point  of  a  change  in  loading 
direction.  They  compressed  tubes  of  14S-T4  aluminum  alloy 
into  the  plastic  range  to  strains  of  about  0.5%  then 
abruptly  changed  the  loading  path  and  continued  loading  at  a 
fixed  ratio  of  axial  stress  increment  to  shear  stress 
increment,  do/dt  .  Their  intent  was  to  look  ft  shear  and 
axial  strain  response  just  after  this  loading  corner.  The 
simple  plasticity  theories  in  use  at  that  time  predict  quite 
different  strain  behavior.  The  J.  isotropic,  flow  theory 
contains  a  smooth  yield  surface  so  it  predicts  that  the 
initial  shear  strain  response  would  be  elastic  at  a  change 
in  the  loading  direction. 

The  J_  deformation  theory  and  the  then  recently 
proposed  slip  theory  predict  the  immediate  accumulation  of 
plastic  flow.  They  both  predict  a  tangent  modulus  which  is 
reduced  from  its  eltxs-  c  value  by  the  formation  of  a  corner 
on  the  yield  surfacr  By  determining  which  theory  better 
approximated  the  experiments,  the  authors  hoped  to  explain 
why  plastic  buckling  experiments  agreed  better  with 
calculations  using  a  reduced  tangent  modulus  while  the  body 
of  experimental  evidence  had  supported  the  model  of  a  smooth 
yield  surface. 

Their  results  are  not  repeated  here  in  detail  except 
to  describe  the  general  trends  and  the  authors  conclusions. 
In  each  specimen  the  initial  shear  response  was  elastic. 
For  all  ratios  ofdcr/dxthe  elastic  shear  strain  accounted  for 
all  the  measured  shear  strain  just  after  the  loading  corner. 
This  observation  is  in  accordance  with  J2  isotropic,  flow 
theory  or  any  flow  theory  having  a  smooth  yield  surface. 

For  continued  straining  the  results  did  not  favor  one 
theory  over  another.  The  experiments  showed  shear  response 
that  was  "softer"  thsn  predicted  by  isotropic  hardening  flow 
theory  but  was  "stiffer"  than  predicted  by  slip  or 
deformation  theories.  The  experimental  results  fell  between 
the  predictions. 

For  most  of  the  cases,  th  >  continued  accumulation  of 
plastic  strain  after  the  loading  corner  is  underestimated  by 
these  theories.  One  explanation  lies  with  the  treatment  of 
the  behavior  of  this  aluminum  alloy  as  rate  independent  at 
room  temperature.  The  tests  were  run  at  a  very  slow  strain 
rate  (  — *  10“  sec  ~l)  and  a  component  of  creep  strain  might 
be  expected.  This  would  lead  to  an  increase  in  the  axial 
strain  over  the  predictions  of  the  rate  independent  theory. 

II.  RATE  DEPENDENT  SLIP  THEORY 

In  a  1983  paper  by  Pan  and  Rice  [21,  recent  interest 
was  shown  in  these  experiments.  Rate  dependence  was  used  to 
improve  the  predictions  of  slip  theory.  Pan  and  Rice 
investigated  the  implications  of  introducting  a  slight  rate 


dependence  into  the  simple  slip  theory  of  Batdorf  and 
Budiansky  [31.  The  original  assumption  was  that  the  shear 
strain  on  any  slip  system  in  a  crystal  is  a  function  only  of 
the  maximum  resolved  shear  stress  on  that  system  over  t'r  e 
loading  history.  This  leads  to  a  rate  independent  theory 
for  the  macroscopic  constitutive  behavior  when  integrated 
over  all  slip  directions  and  slip  systems. 

Pan  and  Rice  assumed  that  the  microscopic  behavior  is 
slightly  rate  dependent  through  a  non-linear  viscous  re¬ 
lation: 


*  ■  ^(ITy7> 


1/m 


(1) 


whereyis  the  plastic  shearing  rate,  m  is  the  plastic  strain 
rate  sensitivity,  a  is  the  reference  plastic  shearing  rate 
and  g(Y)  is  a  function  of  the  current  state.  Note  that  g(Y) 
is  just  the  function  for  t  when  Y  =  a, 

Several  values  for  m  were  chosen  since  separate  tests 
for  strain  rate  sensitivity  had  not  been  ' conducted.  The 
value  of  m  which  gave  the  best  matching  with  the 
nonpropor ci<~nal  tests  was  0.03.  This  value  is  a  little 
higher  than  one  would  normally  expect  for  aluminum  at  room 
temperature  T21. 

Pan  and  Rice  show  results  for  3  of  the  6  experiments 
conducted  by  Budiansky  et.  al.  In  each  case  they  show  that 
the  introduction  of  rate  dependence  can  greatly  increase  the 
agreement  of  slip  theory  with  the  experiments.  Their 
results  are  repeated  here  in  Figures  1-3.  Note  that  the 
original  results  of  Budiansky  et.  al.  are  also  plotted.  An 
initial  elastic  shear  stress-strain  response  is  predicted  at 
the  goading  corner  in  accordance  with  observations.  The 
continued  deformation  is  also  predicted  quite  ./ell  although 
there  is  some  divergence  between  theory  and  experiment  at 
higher  strain.  By  judicious  choice  of  the  strain  rate 
sensitivity  parameter,  rate  dependent  slip  theory  can  be 
shown  to  give  a  good  description  of  these  nonproportional 
loading  experiments. 

For  detailed  calculations  on  buckling  and  other 
instability  phenomena  the  information  obtained  from  these 
types  of  tests  are  crucial.  The  predicted  loads  are  very 
sensitive  to  the  transverse  stiffness  after  longitudinal 
plastic  straining.  The  rate  dependent  slip  theory  is  shown 
to  provide  a  good  model  which  can  match  experiments  quite 
well  by  adjusting  the  strain  rate  sensitivity.  This  sort  of 
a  microscopically  based  model  is  useful  when  considering 
simple  geometries  and  homogenous  stress  states  but  is  far 
too  computationally  expensive  for  use  in  general  analysis 
such  as  might  be  conducted  using  a  finite  element  code.  It 
is  for  this  reason  that  this  author  has  examined  simple, 
kinematic  hardening  in  the  context  of  nonproportional 
loading. 


III.  KINEMATIC  HARDENING 


Budiansky  at.  al.  fll  remarked  that  their  data  might 
be  best  correlated  by  a  linear  flow  theory  whose  loading 
function  gives  a  higher  curvature  to  the  loading  surface  at 
the  prestress  point  that  does  isotropic  hardening.  The 
simple  kinematic  hardening  model  proposed  by  Prager  f 4 1 
satisfies  just  such  a  set  of  conditions,  the  curvature  being 
given  by  that  of  the  initial  yield  surface.  Prager  first 
introduced  the  concept  of  a  translating  yield  surface  in 
1955  so  the  model  -.’as  not  available  to  Budiansky  et.  al.  at 
the  time  they  analysed  these  experiments.  It  appears  that 
nonproportional  loading  experiments  of  this  type  have  never 
been  examined  w^th  the  kinematic  hardening  model.  After  the 
early  1950' s,  the  experimental  emphasis  in  biaxial 
plasticity  turned  away  from  studying  flow  rules  to  plotting 
yield  loci.  Kinematic  hardening  concepts  have  been 
successfully  used  to  describe  some  of  the  phenomena 
associated  with  yield  surface  movement  but  as  a  flow  rule 
the  theory  has  not  been  subject  to  the  same  experimental 
scrutiny. 

Without  going  into  a  detailed  discussion  of  the 
development  of  this  phenomenological  theory,  a  few  remarks 
are  appropriate.  The  theory  considered  here  is  that 
proposed  by  Prager  [41  and  later  modified  by  Ziegler  [51. 
Restricting  ourselves  to  small  strains  we  consider  an 
initial  yield  surface  of  the  von  Mises  type  which  retains 
its  size  and  shape  but  translates  without  rotation  during 
plastic  straining.  The  flow  rule  is  associative  and  the 
evolution  law  for  the  position,  in  stress  space,  of  the 
yield  surface  center  is  given  by 

a  -  u(S-a)  (?) 

where  S  is  the  stress  deviator  and  v  is  the  scalar  function, 
derivaFle  from  the  consistency  condition,  which  describes 
the  hardening  behavior.  This  theory  was  applied  to  the 
experiments  of  Budiansky  et.  al.  A  power  law  form  was 
applied  to  match  the  standard  uniaxial  stress  strain  curve 
in  compression  given  in  [21. 

eP  - c  <Sr-  1>n 

where  c#=  25  ksi  (3) 

n  =  3.33 

c  =  0.0317 

These  values  gave  a  very  good  match  to  the  compression 
experiment  and  provided  easy  evaluation  of  the  stiffness  at 
any  strain  level  during  the  nonproportional  test.  In  order 
to  account  for  slight  differences  in  material  properties 
between  specimens,  we  adopt  the  same  approach  as  Budiansky 
et.  al.  During  the  pure  compressive  loading  portion  of  each 


test  the  uniaxial  stress  strain  curve  was  compared  with  of 
the  standard  curve.  The  ratio,  denoted  by  a,  of  the  stress 
given  by  the  standard  curve  to  that  of  the  individual 
specimens  during  the  compressive  loading  was  used  to  modify 
the  expression  above.  They  assumed  that  the  plastic  strain 
would  be  &  function 
of  A  times  o  . 

eP  =  c  (  ~~  l)n  (4) 

where  ^  ^standard  curve 

<ax)  specimen 

This  allowed  the  same  uniaxial  stress  strain  relation 
to  be  used  for  all  the  specimens  even  though  small 
differences  in  the  flow  stress  level  were  exhibited  between 
specimens.  The  values  of  A  were  determined  from  the 
compressive  loading  portion  of  the  tests.  They  are 

tabulated  in  fll. 

The  kinematic  hardening  relations  werv  coded  using  a 
one-step,  F'^er  explicit  integration  scheme.  The  step  size 
was  varied  to  study  error  accumulation.  Increasing  the 
number  of  equal  steps  from  100  to  1000  chanqed  the  final 
plastic  strain  by  less  than  1%.  Since  error  varies 
inversely  with  step  sice  in  a  linear  fashion  for  explicit 
Euler  1000  steps  was  considered  sufficient  for  predictions 
within  experimental  accuracy.  The  same  procedure  was  also 
used  to  integrate  small  strain,  inotropic  hardening 
relations  for  comparison. 

The  six  loading  histories  tented  by  Budiansky  et*  al. 
were  considered.  They  are  shown  schematically  in  Figure  4. 
Notice  that  the  ratios  ofda/dxfor  continued  loading  varied 
from  +1.91  to  -1.13.  This  covered  the  range  from  total 
loading  to  elastic  unloading. 

Figures  5-10  show  the  results  for  the  isotropic  and 
kinematic  models  compared  to  the  experimental  data.  Shear 
stress  versus  plastic  shear  strain  and  versus  the  increase 
in  plastic  axial  strain  are  plotted.  Notice  in  each  case 
tha*-  the  kinematic  hardening  model  matches  the  shear  strain 
response  very  well.  The-  kinematic  model  accurately  predicts 
the  softer  response  for  shear  following  axial  extension. 
The  most  interesting  point  is  that  the  kinematic  model  does 
such  a  good  job  cf  predicting  when  plastic  flow  will 
recommence  for  the  two  cases  when  the  loading  trajectories 
go  back  through  the  elastic  zone  of  the  kinematic  and 
isotropic  models  (  dc/dt-=  -0.65f,  -1.13).  This  is  clearly 
seen  from  Figure  4  where  the  shear  stress  levels  for  the 
intersection  of  the  loading  path  with  the  yield  surface  is 
much  different  for  the  two  models.  The  kinematic  model 
yields  results  much  closer  to  experimental  observation. 
This  is  an  example  of  ».ow  non-proportional  loading  tests  are 
valuable  and  necessary  in  constructing  a  flow  rule. 
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da 

An  interesting  behavior  is  predicted  for  the  case  dt  * 

-1.13.  Figure  10  shows  that  the  axial  strain  changes 

direction  for  the  kinematic  hardening  model.  This  is  a 

result  of  the  yield  surface  translating  far  enough  to  the 

right  that  the  loading  point  has  moved  around  to  the  left 

half  of  the  yield  surface.  Unfortunately,  the  experiments 

were  not  run  far  enough  to  show  whether  this  behavior  would  j 

occur.  The  kinematic  model  does  a  good  job  of  predicting 

these  axial  strains.  The  kinematic  hardening  model  does 

better  than  the  other  theories  applied  to  this  problem.  It 

predicts  more  axial  straining  than  the  other  theories  and 

provides  a  good  overall  match  with  experiments. 

IV.  CONCLUSIONS 

The  calculations  presented  here  demonstrate  that 
although  most  plasticity  theories  yield  identical  results 
when  applied  to  proportional  load  histories,  the  change  in 
loading  direction  can  greatly  affect  the  predicted  material 
response.  In  particular,  simple  kinematic  hardening  was 
shown  to  provide  much  better  agreement  with  experiment  than 
isotropic  hardening.  In  light  of  the  overwhelming  use  of 
isotropic  hardening  in  even  this  small  strain  regime  the 
analyst  must  use  care  in  applying  a  particular  hardening 
model.  The  results  presented  here  indicate  that  kinematic 
hardening  should  be  a  more  suitable  model  for  buckling  or 
bifurcation  studies.  In  fact,  Tvergaard  showed'  that 

large  strain,  kinematic  hardening  provided  good  results  for 
biaxial  necking. 
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Figure  1.  Results  of  Fan  and  Rice. 12]  showing  comparison 
of  the  various  theories  with  the  experiments  of  Budiansky 
et.  al.  fl]  for  do/dr  *  1.91  .  Note  that  they  use  the 
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Figure  2.  Results  of  Pan  and  Rice  12]  shoving  comparisons 
among  the  various  theories  for  do/dt  «  -1.13  . 
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Figure  3.  Results  of  Pan  and  Rice  12]  showing  cor.sarisons 
among  the  various  theories  for  do/dt  »  1.18  . 
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Figure  $ .  Comparison  of  isotropic  end  kinematic  hardening 
flow  theories  with  experiments  of  Budlansky  et,  al,  11]. 
Filled  triangles  Indicate  measurements  of  plastic  flow  along 
do/dt  *  1-91  after  initial  axial  compression. 
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Figure  6  .  Comparison  of  isotropic  and  kinematic  hardening 
flow  theories  with  experiments  of  Budiansky  et.- al. >  [ 1] . ■ 
Filled  triangles  indicate  measurements  of  plastic  flow  along 
do/dr  ■  1*18  after  initial  axial  compression..' 
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figure  5  •  Comparison  of  isotropic  ana  kinematic  hardening 
flow  theories  with  experiment?  of  Budiansky  et.  al.  [1]. 
Pilled  triangles  indicate  measurements  of  plastic  flow  along 
do/dt  ■-0.655  after  initial  axial  compression. 
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Figure  10  •  Comparison  of  isotropic  and  kinematic  hardening 
flow  theories  with  experiments  of  Budiansky  et.  al.  [1]. 
Filled  triangles  indicate  measurements  of  plastic  flow  along 
do/dx  »  -1.13  after  initial  axial  compression. 


W3  *  i 


rzrm  ~  r  i  n  z  wj-%  ~  z  * — -W  .  T  —  JT^J£r^~ ~WJ* ’'-*’-5^ 


k  r.x  i**2au£xsraim  v  ^  szM^rf 


'  j 

I 

*  * 


& 

'"; 

ts\ 

1 

| 

I 


u 


r~m 

u 


'S-i 

’■i 


m 


THE  INVERSE  GAUSSIAN  PULSE  IN  THE  EXPERIMENTAL  DETERMINATION  OF 
LINEAR  SYSTEM  GREEN'S  FUNCTIONS 
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and 

Nelson  N.  Hsu 

Center  for  Manufacturing  Engineering 
National  Bureau  of  Standards 
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Abstract 


We  present  a  new  time  domain  deconvolution  method  for  determining  the 
"impulse  response"  of  linear  time  invariant  systems.  The  method  is  based  on 
the  use  of  the  one-sided,  causal,  inverse  Gaussian  pulse  as  an  approximation 
to  the  Dirac  6-function.  Deconvolution  of  that  kernel  is  equivalent  to  an 
inverse  heat  conduction  problem.  The  method  is  particularly  useful  in  cases 
where  the  Green's  function  for  the  linear  system  has  singularities  such  as 
jumps,  cusps,  spikes,  and  the  like.  Computational  reconstructions  of 
singularities,  from  smooth  synthetic  data,  are  presented  In  the  context  of 
Acoustic  Emission  Green's  functions. 


1.  Introduction 


The  problem  of  determining  the  "Impulse  response"  of  a  linear  time  invariant 
system  occurs  in  many  areas  of  measurement  science.  If  g(t)  Is  a  causal  time 
signal  representing  the  system's  response  to  a  Dirac  6-function  input  at 
t  =  0,  then  g(t)  is  the  causal  Green's  function  or  impulse  response  of  the 
system.  If  g(t)  is  known,  the  system  may  be  considered  a  "black  box";  the 
outpuc  y(t)  for  any  given  input  x(t),  is  the  convolution  of  x  with  g. 


(1.1)  y(t)  =  /  x(t-T )  g(-r)dT  =  /  g(t-T)  x(x)dr. 
0  0 


The  response  to  a  Heaviside  input  H(t)  is  sometimes  preferred.  This  response, 

i 

also  called  the  Green's  function,  will  be  denoted  by  G(t).  Clearly, 
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(1.2)  6(t)  -  /  g(s)  ds, 

0 

and  from  (1.1), 

(1.3)  y(t)  =  /  G(t-T)  x(t )dr . 

0 


In  many  important  cases  g(t)  or  G(t)  are  not  smooth  functions  of  t.  Rather, 
they  exhibit  singularities  such  as  jumps,  spikes,  cusps,  and  other  strikingly 
sharp  features.  In  fiber  optics  measurements,  an  optical  time  domain 
reflectometer  system,  (OTDR),  is  often  used  for  fiber  parameter  estimation; 
see  e.g.  [1,  p.  236],  [2,  p.  391],  [3],  and  the  references  therein.  This 
non-destructive  technique  uses  short  pulses  of  light  which  are  launched  into 
one  end  of  the  fiber,  and  the  reflected  signal  is  observed,  as  a  function  of 
time,  at  the  same  input  end.  The  instrument  is  based  on  the  fact  that 
fluctuations  in  the  refractive  index  along  the  fiber,  as  well  as  defects  and 
imperfections,  cause  light  to  be  backscattered.  Isolated  imperfections 
produce  sharp  spikes  superimposed  on  an  otherwise  smooth  monotone  profile.  The 
impulse  response  of  the  fiber  is  particularly  useful  in  characterizing  the 
fiber,  as  well  as  in  locating  flaws,  [4,  p.  61],  [5,  p.  1],  Another  example 
in  nondestructive  testing  is  the  field  of  acoustic, emission,  (AE).  Ultrasonic 
waves  emitted  by  stressed  regions  in  elastic  materials  are  detected  by  means 
of  transducers,  and  these  time  signals  are  used  to  locate  structural  flaws. 

See  [6],  £7],  [8],  [9].  In  such  studies,  the  dynamic  Green's  function  for  the 
structure  plays  an  important  role  as  a  general  representative  solution  to  the 
problem  of  elastic  wave  generation  and  propagation  caused  by  a  localized 
source.  In  the  case  of  an  infinite  plate,  a  comprehensive  treatment  of  the 
Green's  tensor  was  recently  given  in  [10].  In  typical  configurations,  see 

390 


[10],  the  Green's  function  is  neither  positive  nor  differentiable;  the 
singularities  in  the  impulse  response  carry  valuable  information  regarding  the 
times  of  arrival  of  various  wave  components  of  the  signal.  ? 

Ideally,  by  using  probe  waveforms  in  the  form  of  the  5-function  or  the 
Heaviside  function,  the  impulse  response  can  be  .obtained  experimentally. 
However-  the  "band-limited1'  nature  of  signals  which  can  be  produced  in  the 
laboratory  preclude  such  infinitely  sharp  probe  waveforms.  Rather,  C” 

;  i 

approximations  to  6(t)  cr  H(t)  are  synthesized  and  used.  The  result  is  to 
smooth  out  or  blur  some  of  the  important  singularites  ip  the  system's  Green's 
function.  In  order  to  recover  the  sharp  features,  an  ill-posed  deconvolution 
problem  n.jst  be  carefully  solved.  This  requires  constraints  on  the  unknown 
Green's  function  in  order  to  stabilize  the  numerical  procedure.  Two  widely 
used  constraints  in  the  time  domain,  consist  in  prescribing  an  a-priori*  bound 
on  the  second  derivative  of  g(t),  or  requiring  g(t)  to  be  non-negative.  While 
such  regui ari zati on  techniques  often  succeed  in  reconstructing  smooth 
or  non-negative  functions,  they  are  not  applicable  to  cases  such  as  the  plate 
Green's  function.  In  this  paper,  we  outline  an  alternative  time  domain  * 
deconvolution  technique,  based  on  the  "inverse  Gaussian"  pulse.  This  pulse  Is 
intimately  related  to  a  heat  conduction  problem,  and  this  connection  »*an  be 
exploited  to  impose  weaker  constraints  on  the  unknown  solutions,  such  as  a 
bound  in  the  l2  -norm.  In  addition,  the  deconvolution  can  be  implemented  as  a 
Cauchy  problem  for  a  simple  second  order  partial  differential  equation.  This 
approach  allows  for  continuous  deconvolution,  a  useful  option  which  permits 
monitoring  the  development  of  suspected  artifacts.  A  more  detailed  discussion 
of  this  technique  is  given  in  [11]. 


2.  Thi»  Inverse  Gaussian  Pulse 


Let  o  >  0  be  a  fixed  parameter,  and  consider  the  function  of  time,  k(o,t)» 
defined  by 

(2.1)  k(a,t)  3  H(t)  exp  (”5t). 


This  C"  function  of  t  on  (-«,«•)  is  called  an  "Inverse  Gaussian"  distribution 
in  statistics  and  is  related  to  Brownian  motion.  See  [12],  [13],  [14,  p.  221] 
and  their  references.  For  small  o  >  0,  k(c,t)  approximates  6(t) ;  for  larger 
o,  k(o,t)  is  a  signal  of  short  duration.  Convolution  with  k(o,t)  smooths  out 
sharp  features  at  an  increasing  rate  the  larger  a  is  chosen.  The  indefinite 
integral  of  k(o,t)  is  a  causal  C"  approximation  to  H(t),  and  can  be  expressed 
in  terms  of  the  complementary  error  function.  We  have. 


(2.2)  e(o,t)  «  J*  k(o,s)ds  *  erfc  )• 

0 

Both  k(o,t)  and  e(o,t)  are  depicted  in  Figure  1. 


Let  g(t)  be  the  impulse  response  of  the  time  Invariant  system  under  study,  and 
let  G(t)  be  as  in  (1.2).  The  response  of  the  system  to  the  probe  waveform 
,e(o,t)  in  (2.2)  is  given  by, 

(2.3)  h(t)  *  /t  e(.o,t-x)  g(x)dx  »  /t  k(o,t-x)  j(x)dx, 

0  0 

while  that  due  to  the  waveform  k(o,t)  is  given  by, 

(2.4)  w(t)  =  J*  k(o,t-x)  g(x)dx. 

0 

Thus,  either  probe  leads  to  a  time  domain  deconvolution  problem  for 
determining  ths  Green’s  function,  in  which  the  kernel  is  the  inverse  Gaussian 
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AMPLITUDE 


pulse  (2.1).  Some  of  the  properties  of  the  convolution  with  k(o,t)  are  listed 
below. 


w(t)  in  (2*4)  is  a  C“  function  of  t  on  (0,«)  even  if  g(t)  is  in 

L“(0,»).  Moreover,  using  Schwarz's  inequality  in  (2.4),  derivatives  of 

* 

w(t)  can  be  bounded  in  the  supremum  norm,  in  terms  of  the  L  norm  of 
g(t),  if  a  >  0. 


As  a  map  from  l2{0,»)  into  L2(0,«)  the  convolution  in  (2.4)  is  a 


holomorphic  semi-group  in  the  variable  a  for  a  >'0.  See  [15,  p.  1453. 


The  Laplace  transform  of  k(o,t)  is  exp(-o/s).  The  inverse  kernel  to 
k(o,t)  has  the  Laplace  transform  exp(o/s),  a  >  0.  This  inverse  kernel 


cannot  exist  as  a  Schwartz  distribution,  since  the  Laplace  transform  of 
a  Schwartz  distribution  has  polynomial  growth  at  infinity.  See  [16,  p. 
189],  Thus,  "direct"  deconvolution  in  (2.4)  is  not  feasible.  This  is 
always  the  case  with  C"  probes.  See  [17,  p.  137,  p.  145]. 


IV.  Let  z  and  a  be  positive  constants  such  that  o  =  (£/a).  Let  u(x,t)  be 
the  unique  bounded  solution  of  the  heat  conduction  problem  for  the 


semi -infinite  rod. 


ut  =  a2  uXx»  0  <  x  <  »,  t  >  0, 


(2.5)  'u(x,0)  =  0,  0  <  x  < 


u(0,t)  =  g(t),  lim  u(x,t)  =  0 


Then,  see  e.g.  [18,  p.  172],  w(t)  in  (2.4)  is  u(x,t)  evaluated  at  x  5  i 


As  a  consequence  of  IV,  deconvolution  In  (2.3)  or  (2.4)  Is  equivalent  to 

solving  the  "inverse  heat  conduction"  problem  whereby  the  boundary  temperature 

history  g(t)  (or  G(t)),  at  x  =  0,  is  reconstructed  from  knowledge  of  the 

temperature  history  at  the  location  x  =  £.  This  is  an  ill- posed  problem  in 

partial  differential  equations  which  is  discussed  in  detail  in  [19]. 

» 

3.  The  Deconvolution  Procedure 

The  use  of  probe  waveforms  of  the  specific  type  given  in  (2.1),  (2.2)  is 
advocated,  partly  because  such  probes  are  smooth  functions  of  time  whose 
"sharpness"  can  be  controlled  by  means  of  a  single  parameter,  namely  a .  (It 
is  advantageous  to  synthesize  such  probes  wi'ch  as  small  a  value  of  o  as  is 
experimentally  feasible).  However,  an  equally  important  feature  of  (2.1)  is 
the  connection  with  the  heat  conduction  problem  (2.5)  which  can  be  exploited 
to  devise  a  deconvolution  procedure  with  certain  desirable  properties. 

Given  the  probe  parameter  o,  we  interpret  the  recorded  signal  h(t)  in  (2.3) 

(or  w(t)  in  (2.4))  as  the  solution  of  (2.5)  at  x  *  £.  By  the  semi-group 
property  II  above,  we  may  use  h(t)  to  continue  the  solution  to  the  right  of  x 
=  £  by  convolution,  and  obtain  both  u(x,t)  and  ux(x,t)  along  the  line  x  *  l'  > 
£•  These  two  functions  of  time  comprise  the  necessary  initial  data  for 
integrating  the  heat  equation  "sideways";  i.e.  as  an  initial  value  problem  in 
the  x- variable,  in  the  direction  of  decreasing  x,  from  x  =  V  to  x  =  0.  See 
[19,  Section  4]. 

Continuous  dependence  in  the  L2  norm,  can  be  restored  to  the  sideways  heat 
equation  by  imposing  an  a-  priori  bound,  M,  on  the  L2  norm  of  the  unknown 
boundary  data  at  x  =  0,  I.e.  on  the  desired  system  Green's  function  G(t).  In 
addition,  an  a-  priori  estimate,  e,  is  assumed  known  for  the  L2  norm  of  the 
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noise  in  the  recorded  data  h(t).  Let  «  be  the  L2  "noise  to  signal"  ratio, 
(3.1)  »-(*). 

The  regularization  is  accomplished  by  a  preliminary  smoothing  of  the  initial 
data  in  the  frequency  domain,  using  FFT  algorithms.  The  smooth  initial  values 
are  then  used  in  a  discrete  numerical  step  by  step  marching  procedure  in  the 
x-variable.  The  filtering  function  uses  w  in  (3.1)  as  the  only  a-  priori 
information,  and  the  regulari zati or.  is  exactly  equivalent  to  minimizing  the 
appropriate  Tikhonov  functional.  See  [19]  for  full  details. 

4.  Continuous  Deconvolution 

We  may  picture  the  marching  procedure  described  above  as  a  systematic 
continuous  unsmoothing  of  the  recorded  data  as  we  approach  the  boundary  x  a  0 
in  the  associated  fictitious  heat  conduction  problem  (2.5).  By  outputting  the 
intermediate  results  at  positive  values  of  x  as  x  tends  to  zero,  one  can 
monitor  this  unsmoothing  process.  Any  such  intermediate  result  is  a  partial 
deconvolution,  while  total  deconvolution  corresponds  to  the  solution  at  the 
boundary  x  =  0. 

Consider  now  the  solution  of  the  heat  flow  problem  (2.5)  at  some  small 
positive  value  of  x,  *  ■  x0  say.  Even  if  the  boundary  data  g(t)  lies  in  L?, 
u(x0,  t)  is  a  C"  function  of  t  which  faithfully  approximates  g(t)  if  x0  is 
sufficiently  small.  Moreover,  constraints  on  u(x0,  t),  together  with  all  its 
derivatives,  have  been  implicitly  placed  by  virtue  of  the  a-  priori  bound  M  on 
the  L2  norm  of  g(t).  See  property  I  above.  Thus,  at  x  *  x0,  the  regularized 
marching  procedure  is  approximating  a  well -const rained  smooth  function  of  t 
which  in  turn  approximates  the  non-smooth  boundary  data.  This  is  the 
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interpretation  which  should  be  placed  on  the  notion  of  partial  deconvolution 
at  x0.  In  this  sense,  the  heat  flow  problem  associated  with  the  inverse 
Gaussian  pulse  allows  for  a  deconvolution  procedure  in  which  relatively  weak 
L2  constraints  can  be  imposed.  Error  bounds,  of  logarithmic  convexity  type, 

for  the  reconstruction  at  any  x  are  given  in  [19,  Theorem  1].  These  estimates 

2  /  x/t\  2 

show  that  the  L  norm  of  the  error  at  x  is  Ole  y,  if  e  is  the  L  error  in 

the  recorded  data  at  x  =  A. 

5.  Numerical  Deconvolution  Experiments  with  Synthetic  Data 
We  present  two  examples  of  Green’s  function  reconstruction  for  the  case  of  an 
infinite  elastic  plate.  These  example.',  from  the  field  of  acoustic  emission, 
have  singularities  which  are  similar  to  those  encountered  in  optical  fiber 
back scattering  measurements  in  the  presence  of  imperfections.  See  [3,  Figures 
3-2,  3-88],  [10,  Figure  15]  and  [9,  Figures  6B,  11],  Hence,  the  applicability 
of  the  technique  in  two  different  physics'  contexts  can  be  demonstrated 
simultaneously. 

Recent  work,  [20],  has  made  it  possible  to  develop  computational  software 
capable  of  producing  the  exact  Green's  function  for  an  infinite  elastic  plate, 
given  the  test  configuration.  Use  was  made  of  this  software  to  obtain  several 
typical  examples  of  plate  Green's  functions.  By  careful  numerical  convolution 
with  the  probe  waveform  (2.1),  using  adaptive  quadrature  routines,  synthetic 
smoothed  out  data  were  created,  simulating  the  recorded  signals.  These  data 
were  then  used  as  input  initial  values  into  the  deconyolution  algorithm  of 
Section  3.  Sy  comparing  the  deconvolution  results  with  the  known  exact 
Green's  functions,  the  performance  of  the  procedure  can  be  assessed. 


The  numerical  experiments  deal  with  the  G  component  of  the  Green's  tensor, 

w  O 

describing  the  normal  displacement  due  to  a  normal  applied  force.  One  example 
studied  is  the  epi central  response  to  a  6-function  input.  The  other  example 
is  the  response  to  a  Heaviside  input  source,  located  two  plate  thickness  away 
and  on  the  same  side  of  the  plate  as  the  receiver.  Our  figures  below  display 
normalized  displacement  versus  normalized  time,*where 

(5.1)  Normalized  Displacement  *  ir*  Shear  Modulus  *  Plate  Thickness* 

Di splacement/Force 

(5.2)  Normalized  Time  »  Time*  Shear  Wave  Speed/Plate  Thickness. 


Our  experiments  take  place  on  the  time  interval  [0,5]  in  normalized  units, 
using  synthetic  data  at  550  equi spaced  points  in  that  interval.  The  probe  in 
(2.1)  was  used  with  a  parameter  value  of  a  =  .5,  time  being  measured  in 
normalized  units.  The  related  heat  flow  problem  (2.5)  was  given  the  constant 
diffusivity  a2  «  .25,  while  t  was  fixed  at  the  value  .25,  so  that  a  =  (z/a). 
The  marching  scheme  in  the  x-varlable  was  implemented  using  1000  equi spaced 
mesh  points  on  the  x-interval  [0,*].  Because  of  discretization  errors  in  the 
numerical  computation  of  the  exact  Green's  functions  as  well  as  in  the 
subsequent  convolutions,  the  synthetic  Input  data  are  not  free  from  noise. 
Accordingly,  a  value  of  w  *  lO"**  was  selected  in  (3.1),  reflecting  an  estimate 
of  the  probable  L2  norm  of  the  error  in  the  recorded  data.  When  the  noise 
level  is  of  the  order  of  0.1%  cr  lower,  the  deconvolution  procedure  is  not 
found  to  be  overly  sensitive  to  the  value  of  w,  beyond  the  general  order  of 
magnitude  of  that  quantity. 


In  Figure  2,  the  epi central  response  is  considered.  The  exact  Green's 
function  has  a  series  of  sharp  peaks,  while  the  simulated  convolved  signal 
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bears  little  resemblance  to  the  exact  solution.  Total  deconvolution  was 
attempted  with  the  calculation  pursued  up  to  the  boundary  x  -  0.  While 
"ringing"  phenomena  are  visible  in  the  reconstruction,  the  ability  of  the 
scheme  to  restore  and  localize  these  sharp  peaks  is  significant.  In  Figure  3, 
with  source  and  receiver  on  the  same  side  of  the  plate,  the  exact  Green's 
function  has  a  sharp  trough  and  a  "corner"  near  t  =  2.2.  These  sharp  features 
are  not  visible  in  the  smooth  recorded  data.  Total  deconvolution  restores 
many  of  these  features  although  an  artifact  in  the  form  of  a  Gibbs  phenomenon 
obscures  the  sharp  corner  near  t  =  2.2.  This  last  example  was  also  studied 
from  the  standpoint  of  continuous  deconvolution.  Random  noise  of  magnitude 
G.1%  was  added  to  each  of  the  550  synthetic  data  values  prior  to  deconvolu¬ 
tion,  and  the  partial  deconvolutions  at  80%  and  90%  of  the  way  back  from 
x  *  £,  were  obtained,  in  addition  to  the  total  deconvolution  at  x  =  0. 

The  results  are  depicted  in  Figure  4.  The  genesis  of  the  Gibbs  phenomenon 
near  t  »  2.2  is  apparent;  substantial  tightening  of  the  bend  to  the  right  of 
the  trough  occurs  prior  to  the  onset  of  ringing,  as  the  algorithm  attempts  to 
reconstruct  the  corner  in  the  exact  solution.  Used  in  thu  way,  continuous 
deconvolution  is  a  desirable  option  which  can  reveal  important  features  in  the 
solution  before  these  features  become  obscured  by  artifacts. 

i 

6.  Concluding  Remarks 

The  usefulness  of  the  above  procedure  rests  on  the  feasibility  of  an 
appropriate  device  for  producing  the  proposed  waveforms.  At  the  present  time, 
input  electrical  voltages  with  a  prescribed  waveform  can  be  synthesized  using 
a  digital  to  analog  converter.  To  produce  a  mechanical  pulse  with  a  pre¬ 
scribed  time  dependence  requires  a  high  fidelity  transducer  to  convert  the 


synthesized  voltage  into  an  impact  force.  Work  in  this  direction  is  cur¬ 
rently  under  way  at  the  National  Bureau  of  Standards.  See  [21],  [22].  On 
the  other  hand,  the  probes  (2.1),  (2.2)  are  similar  in  shape  to  waveforms 
commonly  found  in  the  experimental  literature.  By  attempting  to  fit  s 
probes  with  the  desired  expressions  (2.1)  or  (2.2),  for  suitable  valuer  o, 
one  can  consider  using  the  above  procedure  in  a  variety  of  linear  system 
contexts.  Future  work  should  explore  the  feasibility  of  such  "inexact" 
deconvolution. 
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ABSTRACT.  The  computer  code  DELTA  uses  a  linearized  Alternating 
Direction  Implicit  (ADI)  scheme  to  provide  a  numerical  approximation  of  the 
solution  of  the  averaged  two-phase  (gas-solid)  two-dimensional  (axisyimnetric) 
equations  governing  viscous  interior  ballistic  flows  within  conventional  guns. 
To  further  the  understanding  of  phenomena  affecting  gun  tube  life  as  well  as 
gun  performance,  a  heat  transfer  model,  which  simulates  the  interactions  of 
fluid  dynamics  and  thermal  profile  in  the  gun  tube  wall,  has  been  incorporated 
in  the  DELTA  code.  The  same  linearized  ADI  method  is  utilized  to. obtain  the 
numerical  solution  of  the  two-dimensional  nonlinear  heat  conduction  equation 
in  the  gun  tube.  Our  model  of  the  neat  transfer  process  couples  completely 
and  simultaneously  all  three  controlling  events  without  any  approximations  to 
the  governing  equations;  that  is,  the  axlsymmetric  viscous  flow  within  the 
entire  gun  tube  which  naturally  gives  a  precise  definition  of  the  gas  thermal 
boundary  layer,  the  time-dependent  balance  of  the  heat  fluxes  at  the  inner 
wall  surface,  and  the  two-dimensional  temperature  calculation  within  the  gun 
tube  wall.  The  nonlinear  heat  flux  boundary  conditions  at  the  inner  and  outer 
tube  surfaces  are  linearized  as  to  be  compatible  with  the  solution  scheme. 

Results  computed  with  DELTA  compare  the  flow  patterns  for  two  different 
types  of  Idealized  one* phase  interior  ballistics  environments.  The  first  Is  a 
pure  expansion  flow,  and  the  other  includes  mass  and  heat  sources.  Besides 
laminar  flow  calculations  the  effects  of  two  algebraic  turbulence  models  are 
considered.  Finally,  simulations  for  an  adiabatic  tube  and  for  a  tube  that 
allows  heat  transfer  are  reported. 


Current  address  is  AT&T  Bell  Laboratories,  Crawford  Corners  Read, 
Holmdel,  NJ  07733 
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I.  INTRODUCTION 


The  interior  ballistics  flows  of  conventional  charges  in  gun  tubes  show 
complex  flow  patterns.  This  is  due  to  both  the  heterogeneous  structure  of  the 
charge  itself,  and  the  rapidly  changing  flow  conditions  within  a  few 
milliseconds.  The  fast  rise  in  pressure  and  temperature  caused  by  the  burning 
propellant  initiates  turbulent,  multidimensional,  multiphase  flow  which  is 
coupled  with  the  accelerating  projectile  motion.  Up  to  now  a  mathematical 
model  that  describes  all  the  physics  occurring  in  a  complete  interior 
ballistics  cycle  is  not  available.  However,  several  models  that  simulate  some 
of  the  phenomena  exist  or  are  being  developed  12] . 

On  the  other  hand,  detailed  experimental  measurements  of  the  total 
interior  ballistic  cycle  are  not  possible.  Some  standard  techniques  as  well 
as  some  new  special  techniques  under  development  determine  only  specific 
quantities  in  real  weapons  or  in  simulators  under  simplified  flow  conditions. 
Commonly  measured  quantities  are  the  gas  pressure  and  projectile  motion. 
However,  quantities  like  the  temperature  distribution  in  the  gas  and  in  the 
gun  tube  wall,  velocity  distributions  of  the  gas  or  solid  particulates  inside 
or  outside  of  boundary  layers,  the  particle  distribution,  or  the  turbulent 
pattern  cannot  yet  be  determined  accurately  by  experiment.  Thus,  the  need  for 
modelling  the  interior  ballistic  cycle  exists  so  that  the  dynamic  development 
of  these  quantities  can  be  studied,  and  their  impact  on  ballistic  problems  can 
be  evaluated. 

One  new  computational  capability  for  investigating  interior  ballistics 
flows  is  the  DELTA  code  which  is  under  development  at  BRL.  The  purpose  of 
this  code  is  to  address  special  ballistic  problems,  related  to  boundary  layer 
development,  heat  transfer  to  the  tube  wall,  turbulence  and  time-dependent 
distribution  of  additive  particles.  In  the  following  the  basis  of  DELTA  and 
its  applications  will  be  discussed. 


II.  REVIEW  OF  THE  MODELS  IN  DELTA 

The  flow  to  be  modeled  by  DELTA  is  the  multidimensional,  two-phase  flow 
inside  of  a  gun  tube.  Presently,  the  geometry  of  the  flow  is  simplified  to 
axisymmetry.  At  the  rear  end,  the  so-called  breech,  the  tube  is  closed  by  a 
stationary  flat  plate  while  the  front  boundary  is  represented  by  the  moving 
flat  based  projectile.  The  flow  is  always  assumed  to  be  viscous  and  heat 
conducting,  but  can  be  either  laminar  or  turbulent.  The  wails  may  be 
adiabatic  or  allow  heat  transfer  from  the  gas  to  the  surrounding  metal 
surfaces.  Heat  transfer  is  restricted  to  the  tube  wall.  The  core  flow  is 
fully  coupled  to  the  moving  projectile,  to  the  boundary  layer  development,  and 
if  desired,  to  che  heat  conduction  in  the  tube  wall.  By  fully  coupled  we  mean 
that  each  of  these  phenomena  can  affect  any  other  one;  for  example,  the 
boundary  layer  development  can  alter  the  details  of  the  core  flow.  This 
situation  is  not  the  case  in  a  boundary  layer  type  model. 

The  mathematical  model  in  DELTA,  the  balance  equations  for  the  gas-phase 
and  one  solid  phase,  are  based  on  the  unsteady  volume-averaged  formulation. 
The  gas  phase  is  described  by  the  averaged  equations  corresponding  to  the  full 
Navier-Stokes  equations  for  a  compressible  fluid.  The  averaged  coefficients 
of  viscosity  and  heat  conduction,  the  averaged  .  viscous  stress  tensor,  the 


averaged  dissipation  function  and  averaged  heat  conduction  function  are 
included.  Since  interior  ballistics  flows  usually  produce  high  gas  pressure, 
the  Noble-Abel  equation  of  state  is  used  so  that  some  real  gas  effects  can  be 
included.  The  turbulence  is  considered  by  algebraic  mixing  length  models. 
The  solid  phase  is  described  by  the  averaged  equations  corresponding  to  an 
incompressible  fluid  which  can  undergo  deformation.  The  derivation  of  these 
equations  is  given  in  Ref.  [3]*  The  equations  are  listed  in  Ref.  [1]. 

The  set  of  partial  differential  equations  for  the  axisymmetric  two-phase 
flow  region  is  solved  by  a  linearized  Alternating  Direction  Implicit  (ADI) 
scheme.  This  scheme  transforms  the  differential  equations  to  a  set  of  linear 
algebraic  equations.  The  corresponding  matrix  has  a  block  tridiagonal 
structure,  and  thus,  the  solutions  at  each  new  time-step  can  be  efficiently 
determined.  Detailed  information  about  the  derivation  of  the  scheme  is 
presented  in  Ref.  ( 1 J . 

III.  HEAT  TRANSFER  TO  AND  TEMPERATURE  DISTRIBUTION  IN  THE  TUBE  WALL 

The  heating  of  the  gun  tube  wall  caused  by  convection,  heat  conduction 
and  radiation  of  the  hot  propellant  gas  along  its  inner  surface  enhances  the 
gun  tube  wear  and  erosion,  and  therefore,  affects  the  lifetime  of  gun-  tubes. 
The  experimental  determination  of  the  inner  wall  surface  temperature  is  quite 
difficult.  The  commonly  used  thermocouples  are  restricted  in  interior 
ballistic  applications  due  to  the  demands  of  an  accurate  measurement  in  a  very 
short  time  interval,  of  the  close  contact  to  the  flow,  and  of  the  special 
thermal  properties  of  the  gauge  itself. 

Currently,  there  exist  several  different  models  for  the  heat  transfer  to 
the  gun  tube  wall.  They  can  be  divided  in  four  categories  according  to  their 
complexity.  The  first  type,  at  most,  uses  a  very  simplistic  boundary  layer 
calculation  and  a  heat  transfer  correlation,  e.g.,  Colburn's  analogy,  to 
obtain  the  heat  transfer  [4—6 J •  Heat  losses  in  the  core  flow  are  considered. 
These  empirical  heat  transfer  correlations  which  are  derived  for  fully 
developed,  steady,  one-phase  pipe  flow  are  used  to  predict  the  highly  unsteady 
flow  and  heat  flux  in  a  gun.  The  main  feature  of  this  type  of  model  is  the 
emphasis  on  the  calculation  of  the  core  flow.  In  contrast  another  category 
tries  to  model  more  exactly  the  boundary  layer  by  using  more  general  boundary 
layer  equations  [7,8J.  The  heat  transfer  to  the  tube  wall  again  is  described 
by  correlations.  The  boundary  layer  edge  is  not  coupled  with  any  computation 
of  the  core  flow.  Instead,  simplified  qualitative  values  are  assumed  which 
represent  approximately  the  core  flow  conditions.  The  third  category  makes 
use  of  the  general  boundary  layer  equations  and  of  a  balance  of  heat  fluxes 
from  the  hot  gas  to  the  gun  tube  wall  at  the  inner  wall  surface  [ 9 J .  The 
conditions  at  the  boundary  layer  edge  are  comparable  to  those  in  the  second 
category.  These  approaches  do  not  include  all  the  feedback  mechanism  to  the 
core  flow,  and  therefore,  to  the  projectile  motion.  That  is,  the  phenomena 
occurring  in  the  boundary  layer  regions  are  not  fully  coupled  to  the  phenomena 
occurring  in  the  regions  away  from  the  boundary  layer,  the  core  flow  region. 
The  fourth  category  is  the  fully  coupled  approach  wherein  the  phenomena 
associated  with  the  core  flow  is  directly  linked  to  the  projectile  motion,  the 
boundary  layer  development,  the  heat  transfer  to  and  the  heat  conduction  in 
the  tube  wall,  and  vice  versa.  This  is  achieved  by  using  a  single  but  general 
set  of  equations  which  is  valid  everywhere  in  the  gas  region.  The  solution  of 


this  set  automatically  provides  the  boundary  layer  solution  in  the  boundary 
layer  region,  the  core  flow  solution  in  the  core  flow,  and  all  the  necessary 
coupling  that  naturally  occurs  in  the  flow.  Although  this  category  of 
solution  possesses  the  most  complexity,  it  provides  the  solution  with  the 
least  number  of  assumptions  and  approximations.  In  light  of  the  scarcity  of 
experimental  measurements  with  which  to  compare  the  calculations,  we  feel  the 
fourth  category  is  best.  The  DELTA  approach  is  an  example  of  the  fourth 
category. 


The  heat  transfer  model  in  DELTA  consists  of  the  equations  governing  the 
heat  conduction  in  the  tube  wall,  and  the  boundary  conditions  which  couple  the 
temperature  in  the  wall  •  to  the  flow  inside  and  outside  the  tuber.  The  heat 
conduction  in  the  tube  wall  is  described  by  the  two-dimensional,  nonlinear 
axisymmetric  equation  for  the  wall  temperature  Tw(t,r,z): 
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The  variables  t,  z  and  r  denote  the  time,  axial  coordinate  in  the  wall  and 
radial  coordinate  in  ‘  the  wall,  respectively.  The  specific  heat  cw  and  the 
thermal  conductivity  X  of  gun  tube  steel  are  non-constant  and  depend 
remarkably  on  temperature  1 10].  Although,  the  density  of  the  steel  pw 
depends  on  temperature,  its  variation  is  small. 

The  most  important  boundary  condition  is  at  the.  inner  tube  wall  surface 
where  the  coupling  of  the  flow  region  and  the  wall  occurs.  At  this  surface, 
the  balance  of  heat  fluxes  with  a  radiation  effect 
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are  enforced.  The  variables  X  ,  e,  o,  T  and  T  denote  the  thermal 
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conductivity  of  the  gas,  the  emissivity  of  the  wall  surface,  the  Stafan- 
Boltzmann  constant,  the  gas  temperature,  and  the  maximum  gas  temperature  in  a 
given  cross-section  (z  =  constant),  respectively.  We  emphasize  that  both 
conditions  are  used  only  ac  the  inner  tube  wall  surface.  The  first  term  on  the 
left  hand  side  of  the  first  condition  represents  the  heat  flux  on  the  gas  side 
towards  the  wall  by  conduction,  while  radiation  is  included  by  the  second 
term.  The  right  hand  side  gives  the  heat  flux  into  the  tube  wall.  The 
boundary  condition  at  the  outer  tube  wall  surface  is  chosen  as  a  simple 
engineering  condition 
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-  X 


w  3r 


=  h  (T  -  T  .  )  , 
w  amb 


408 


•  If,' 

■V.  ■- 


where  we  are  using  a  heat  transfer  number  h  and  an  outer  ambient  temperature 
Tajnb*  A  more  sophisticated  condition,  at  least  for  a  single  shot  weapon,  is 
not  necessary  because  the  heat  usually  does  not  reach  the  outer  surface  during 
the  ballistic  cycle  due  to  the  wall  thickness*  Two  additional  boundary 
conditions  are  needed  in  axial  direction.  At  the  projectile  base,  we  set 

T  *  T 
w  amb 

across  the  wall  thickness,  that  is,  we  assume  the  projectile  is  moving  into  a 
cold  area  with  ambient  temperature.  At  the  breech,  an  adiabatic  condition 


seems  to  be  adequate. 

These  equations  governing  the  temperature  distribution  in  the  tube  wall 
are  solved  using  the  same  linearized  ADI  method  as  are  the  equations  for  the 
gas  flow  region;  that  is,  the  equations  are  linearized  in  time,  and  are  split 
along  coordinate  directions.  At  each  new  time  level,  we  first  update  the 
temperature  distribution  in  the  wall,  and  then  update  the  governing  variables 
in  the  flow  region.  This  is  performed  by  .  the  following  sequence  of  sweeps 
along  coordinate  directions:  an  axial  sweep  followed  by  a  radial  sweep  in  the 
wall;  a  radial  nweep  followed  by  an  axial  sweep  in  the  gas  region,  and  finally 
an  adjustment  of  the  dependent  variables  along  the  inner  wall  surface  to  the 
flux  boundary  condition.  We  omit  a  discussion  of  most  details  of  the 
numerical  procedure  because  they  are  identical  to  those  explained  in  Ref. 
llj.  Because  the  thermodynamic  dependent  variables  in  the  gas  region  are  the 
specific  gas  entropy  (s)  and  logarithm  of  gas  pressure  (q),  the  heat  flux 
boundary  condition  expressed  in  terms  of  T  must  be  reformulated  an  a  linear 
equation  in  terms  of  s  and  q  at  the  new  unknown  time  level  for  the  radial 
sweep  in  the  gas  region.  To  this  end,  we  transform  the  heat  flux  term  on  the 
gas  side  via  the  chain  rule: 
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where  T„,  T  denote  the  partial  derivatives  of  T  with  respect  to  s  and  q, 
respectively.  For ■ simplicity,  we  dropped  the  index  g.  The  linearization  in 
time  gives  a  relation  between  the  unknown  new  time  level  (n)  and  the  known 
current  time  level  (c) 
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The  time-deriviatives  are  approximated  by 
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so  that  we  get  a  linear  equation  in  sn  and  qn.  This  linear  equation  is 
compatible  with  the  set  of  finite  differenced  linear  equations  derived  from 
the  flow  equations. 


IV.  TURBULENCE  MODEL 


For  studying  the  influence  of  turbulence  on  the  flow  pattern,  two 
different  turbulence  models  were  considered.  Both  are  equilibrium  algebraic 
eddy  viscosity  models  based  on  Prandtl’s  mixing  length '  hypothesis.  A 
turbulent  eddy  viscosity  yfc  as  well  as  a  turbulent  thetmal  conductivity  Xfc  are 
calculated  such  that  molecular  viscosity  y  and  thermal  conductivity  X  in  the 
equations  describing  a  laminar  flow  are  replaced  by  the  effective  values 
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The  two  models  differ  in  the  underlying  assumption  that  the  boundary  layer  is 
composed  of  one  region  or  two  regions. 


The  one-laver  model  expresses  the  turbulent  eddy  viscosity  as 
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where  p  is  the  local  density,  l  is  Pr&ndtl’s  mixing  length,  w  and  u  are  the 
velocity  in  axial  and  radial  direction,  respectively  [II].  Density  and 
velocity  gradients  are  determined  by  the  solution  of  the  partial  differential 
equations.  The  mixing  length  is  given  by  a  correlation.  For  a  steady 
incompressible  flow  in  a  tube  Nlkuradse  [llj  experimentally  determined  that 
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In  using  this  correlation  in  the  unsteady  compressible  flow  simulations,  we 
assume  that  the  correlation  still  models  the  turbulence.  We  try  to  test  this 
assumption  by  comparing  the  simulations  with  another  turbulence  model,  a  so- 
called  two-layer  model. 


The  two-layer  model  separates  the  boundary  layer  In  an  Inner  and  outer 
region  with  different  formulations  for  each  region  [12,13].  The  expression 
for  the  eddy  viscosity  of  the  inner  region  is 
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which  is  the  same  as  the  one-layer  formulation.  The  difference  is  in  the 
definition  of  the' mixing  length  l  ,  that  is, 


l  *  k  y  D, 


where  the  von  Karman  constant  k  *»  0.4  ,  y  is  the  distance  from  the  wall  and  D 
is  the  van  Driest  damping  factor.  Van  Driest  suggested  that 
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with  the  van  Driest  constant  A  *  26  .  The  subscript  w  indicates  that  these 
quantities  are  to  be  evaluated  at  the  wall  surface  (y-o).  The  wall  shear 
8 tress  x  is  expressed  by 
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In  the  outer  layer  we  use 
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v;-pre  we  denotes  the  axial  edge  velocity,  6  the  kinematic  boundary  layer 
displacement  thickness  and  6  the  boundary  layer  thickness.  In  our  case  w 
is  the  maximum  axial  velocity  in  the  cross-section  z»constant,  this  is,-  the 
velocity  on  the  axis  of  symmetry. 


Now  the  two-layer  eddy  viscosity  is  expressed  as 
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where  y  is  the  first  point  at  which  p 


In  both  cases  the  turbulent  thermal  conductivity  is  calculated  by 


where  the  turbulent  Prandtl  number  is  Prt  =  0.9  and  c  is  the  specific  heat 
at  constant  pressure. 


V.  RESULTS. 

Computational  results  are  presented  for  two  different  types  of  interior 
ballistics  flows.  The  first  simulates  a  pure  gas  expansion  flow  behind  a 
projectile  inside  of  a  constant  cross-section  tube.  The  tube  is  closed  at  one 
end  by  a  stationary  surface  called  the  breech,  and  at  the  other  end  by  a 
movable  flat  based  projectile.  The  initial  states  of  the  gas  are  uniform  and 
quiescent.  Geometrical  data,  initial  conditions  as  well  as  the  thermodynamic 
properties  of  the  gas  are  listed  in  Table  I.  He  designate  this 


TABU!  I.  Lagrange  Gun  Parameters 


Bore  Diameter 

Tube  length 

Chamber  Length 

Projectile  Hass 

Ratio  of  Specific  Heats 

Covolume 

Moxar  Mass 


20  mm 
2.0  m 
0.175  m 

120  g 

1.271 

1.08  •  10  J  m  /kg 

23.8  g/mole 


Initial  Gas  Pressure  300  MPa 
Initial  Gas  Temperature  3000  K 
Initial  Velocities  0  m/s 


idealization  as  the  Lagrange  gun.  The  flow  in  the  Lagrange  gun  is  very  well 
suited  for  staying  several  important  features  like  the  numerical  procedure, 
boundary  layer  development,  laminar  and  turbulent  axisymmetric  flow  patterns, 
and  heat  transfer  to  the  tube  wall.  Since  an  expansion  flow  is  quite  removed 
from  the  phenomena  occurring  during  a  ballistic  cycle,  a  second  type  of  flow 
which  has  the  time-dependent  pressure  and  temperature  profiles  as  in  a  real 
tube  weapon  is  simulated  by  adding  heat  and  mass  to  the  one-phase  flow  via 
source  terms.  An  empirical  burning  law  for  pressure-dependent  sources  is 
used.  The  sources  mo«e  with  the  flow.  We  designate  this  idealization  as  the 
real  gun.  The  essential  parameters  for  the  real  gun  differ  from  Table  X  only 
with  respect  to  the  initial  conditions.  The  Initial  gas  pressure  is  assumed 
to  be  ambient  pressure  (0.1  MPa)  and  the  initial  gas  temperature  to  be  ambient 
temperature  (293  K).  In  all  cases  involving  the  real  gun  simulation,  the 
projectile  is  released  from-  its  initial  position  when  the  pressure  at-  the 
projectile  base  reaches  30  MPa. 
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Because  an  implicit  finite  difference  scheme  is  used,  no  stability 
restriction  exists  on  the  size  of  the  time  step  for  a  given  mesh.  We  have 
chosen  a  constant  time-step  of  10  ys  for  computing  all  results  shown  here. 
The  computational  mesh  consists  of  49  uniformly  or  nonuniformly  spaced  mesh 
points  in  the  axial  direction  and  19  nonuniformly  spaced  mesh  points  in  the 
radial  direction.  To  obtain  a  finer  spatial  resolution  near  the  solid  wall, 
the  mesh  points  are  more  concentrated  near  the  wall  than  near  the  center 
line.  An  example  of  a  19x49  computational  mesh,  which  is  used  in  most  of  the 
computations,  is  shown  in  Figure  1.  The  smallest  grid  size  in  radial 
direction  at  the  bore  surface  is  7.7  pm  .  The  mesh  for  computing  the  heat 
conduction  in  the  tube  wall  is  generated  in  the  same  way  with  the  same  mesh 
distribution  in  axial  direction  as  on  the  gas  side,  and  a  corresponding  mesh 
concentration  near  the  inner  bore  surface.  Both  the  size  of  time-step,  and 
number  of  grid  points  seem  to  be  a  reasonable  compromise  between  accuracy  and 
computation  time. 

The  performances  of  the  Lagrange  and  real  gun  for  laminar  flows  with 
adiabatic  boundaries  are  compared  in  Figures  2-5.  The  histories  of  the  gas 
pressures,  temperatures  at  the  center  of  the  breech  and  projectile,  and  the 
velocities  and  displacements  of  the  projectile  are  shown.  The  main 
differences  are  the  temporal  distribution  of  all  the  quantities,  and  the  final 
values  of  the  muzzle  velocities.  For  the  real  gun  simulations,  the  pressure 
at  the  projectile  base  reaches  30  MPa  at  about  2.3  ms  at  which  time  the 
projectile  begins  to  accelerate  down  the  tube. 

Qualitative  results  for  a  laminar  flow  with  adiabatic  walls  in  the 
Lagrange  gun  are  presented  in  Figures  6-10.  We  caution  the  reader  of  the 
different  view  points  in  the  three-dimensional  graphs.  They  were  chosen  to 
best  display  the  results.  The  spatial  distribution  of  the  axial  velocity  is 
shown  in  Figures  6  and  7  at  3.75  ms.  At  this  time,  the  projectile  exits  the 
gun  tube  with  a  muzzle  velocity  of  623  m/s.  Figure  6  shows  the  axial  velocity 
field  when  a  uniformly  axially  spaced  mesh  in  Figure  1  is  used.  Figure  7 
shows  the  same  quantity  but  computed  on  a  nonuniformly  spaced  axial  mesh  =  The 
radial  distribution  of  the  mesh  points  are  the  same  in  both  figures.  For  a 
given  axial  position  the  axial  velocity  is  constant  across  much  of  the  radius 
of  the  tube  (the  core  flow  region),  and  decreases  to  zero  only  very  close  to 
the  wall  (the  boundary  layer  region).  The  boundary  layer  is  the  result  of  the 
no-slip  condition  (w=o)  at  the  wall.  The  thickness  of  the  boundary  layer  is 
approximately  0.2  to  0.3  mm.  In  the  axial  direction,  the  velocity  is 
distributed  linearly  in  the  core  region  between  the  zero  value  at  the  breech 
and  the  muzzle  velocity.  The  three-dime  3ional  temperature  and  pressure 
distributions  are  given  in  Figures  8  and  S  Due  to  the  adiabatic  boundary 
condition  at  the  wall  surface  the  heat  generated  by  the  viscous  forces  near 
the  tube  wall  cannot  transfer  to  the  tube  wall,  and  the  gas  temperature  rises 
towards  the  wall  surface.  Here  again  the  boundary  layer  is  only  0.2  to  0.3  mm 
thick.  The  pressure,  however,  stays  constant  in  radial  direction  over  the 
entire  cross-section.  The  assumption  in  boundary  layer  theory  that  the  radial 
pressure  gradient  Is  zero  would  be  valid  in  this  example.  Figure  10  shows  the 
3-D  graph  of  the  radial  gas  velocity.  In  approximately  the  first  70%  of  the 
distance  to  the  projectile,  the  radial  velocity  is  negative,  i.e.,  the  flow  is 
directed  towards  the  center  line  (C-L).  Thereafter,  it  is  positive  and 
increases  remarkably  towards  the  projectile  base.  Only  very  close  to  the 
projectile  does  the  value  of  the  radial  velocity  drop  rapidly  from  its  maximum 
value  to  zero.  Of  course,  the  radial  velocity  is  small  since  it  is  induced 
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only  by  molecular  viscosity  and  heat  conductivity.  The  results  in  Figures  2- 
10  agree  very  well  with  h  'th  a  one-dimensional  solution  of  the  core  flow  using 
the  method  of  characteristics,  and  the  two-dimensional  numerical  calculations 
of  Heiser  and  Hensel  [14,15]. 

The  assumption  of  a  laminar  flew  may  not  be  realistic  for  the  Lagrange 
gun  considered  above.  Because  the  Reynolds  number  based  on  the  tube's 
diameter  and  muzzle  velocity  is  of  the  order  of  107  ,  the  effects  of 
turbulence  should  be  examined.  In  comparison  to  the  laminar  flow  several 
important  differences  occur  when  the  two  algebraic  turbulence  models  described 
in  IV  are  applied.  However,  the  difference  in  the  results  between  the  two 
turbulence  models  is  insignificant.  Consequently,  we  do  not  specify  the  type 
of  model  in  the  results.  First  of  all,  we  see  the  difference  in  the 
projectile  performance  between  the  laminar  and  turbulent  type  flow.  This  is 
demonstrated  in  Figure  11  with  the  pressure  histories  at  the  breech,  in  Figure 
12  with  the  temperature  histories  at  the  breech,  and  in  Figure  13  with  the 
projectile  velocity  histories.  In  the  turbulent  flow  simulation,  the 
projectile  velocity  is  about  50  m/s  less  than  for  the  laminar  flow  at  muzzle 
time.  The  axial  velocity  flow  field  at  the  time  of  muzzle  clearance  is  shown 
in  Figure  14.  The  velocity  boundary  layer  is  fully  'developed  between  the 
center  line  and  the  tube  wall.  The  axial  velocity  overshoots  the  projectile 
velocity  near  the  center  line.  This  overshoot  is  related  to  the  radial  gas 
velocity  (Figure  15)  which  is  one  to  two  order  of  magnitude  larger  than  for 
the  laminar  flow  (Figure  10)  near  the  projectile  base.  In  this  region,  the 
radial  gas  flux  is  toward  the  tube  wall  as  before,  and  transports  mass  away 
from  center-line  which  in  turn  can  accelerate  the  axial  flow.  The  radial 
variation  of  axial  velocity  (Figure  16),  radial  velocity  (Figure  17)  and 
temperature  (Figure  18)  taken  0.25  m  upstream  of  the  muzzle  show  some  details 
about  the  differences  between  the  laminar  and  turbulent  flows  at  the  time  of 
muzzle  clearance.  Boundary  layer  calculations  in  Ref.  [7]  show  comparable 
trends  between  laminar  and  turbulent  flow.  However,  the  flow  patterns 
computed  by  different  types  of  turbulence  models,  e.g. ,  non-algebraic  models, 
may  differ.  An  experiment  corresponding  to  this  idealized  expansion  flow  is 
needed  to  validate  a  turbulence  model.  Such  experiments  are  being  attempted 
at  the  French-German  Institute  (ISL)  in  France  and  the  Ernst-Mach-Institut 
(EMI)  in  Germany. 

Besides  turbulence,  heat  transfer  is  important  in  an  interior  ballistics 
cycle  because  in  reality  energy  is  lost  from  the  gas  flow,  and  is  gained  by 
the  gun  tube  wall.  This  heating  of  the  wall’s  surface  does  influence  gun  tube 
erosion.  Our  heat  transfer  model  together  with  the  calculation  of  the  heat 
conduction  in  the  wall  couples  the  unsteady  behavior  in  both  media,  and  is 
discussed  in  Section  III.  We  now  show  results  concerning  the  laminar  Lagrange 
gun  expansion  flow  with  heat  transfer  from  the  gas  to  the  tube  wall. 
Initially  the  wall  is  supposed  to  be  at  ambient  temperature.  The  thermal 
properties  of  the  gun^  barrel  are  characterized  by  the  barrel  material 
density  p  =7.8  kg/ra  ,  thermal  conductivity  X  *  43  W/(m  K)  and  specific 
heat  Cy  "  ^60  J/(kg  K).  The  pressure  and  temperature  histories  at  the  breech, 
and  projectile  velocity  history  are  given  in  Figures  11-13,  and  can  be 
compared  with  the  other  two  simulations.  The  spatial  profile  of  the  axial 
velocity,  radial  velocity,  pressure  and  tejperature  are  plotted  in  Figures  19- 
22,  respectively,  at  the  time  of  muzzle  clearance.  When  comparing  the  figures 
to  those  of  the  laminar  flow  with  adiabatic  walls,  we  find  with  heat  transfer 
that  the  velocity  boundary  layer  as  well  as  the  temperature  boundary  layer  are 
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thinner,  that  the  radial  velocity  is  about  one  order  of  magnitude  higher,  and 
that  the  radial  velocity  is  always  directed  towards  the  wall.  For  example, 
the  velocity  boundary  layer  is  only  about  0.05  mm  thick.  Detailed  comparisons 
across  the  boundary  layers  regions  for  axial  velocities  and  temperatures  are 
presented  in  Figures  23  and  24,  and  across  the  tube  cross-section  for  the 
radial  velocities  is  shown  in  Figure  25.  All  these  profiles  are  at  0.25  m 
from  the  muzzle  at  the  time  of  muzzle  clearance.  Figure  26  shows  the  history 
of  the  wall  surface  temperatuio  at  two  different  locations  along  the  tube 
wall.  One  is  taken  50  mm  away  from  the  breech.  This  wall  point  belongs 
to  the  chamber,  and  neated  up  from  the  beginning  of  the  flow  cycle.  The 
second  wall  point  is  250  mm  away  from  the  breech,  and  is  heated  up  only  after 
the  projectile  has  passed  it  (at  0.47  ms).  The  maximum  wall  temperature 
occurs  early  in  the  4  ms  cycle.  The  laminar  gas  layer  close  to  the  tube  wall 
cools  rapidly  because  the  transport  of  heat  in  the  tube  wall  by  conduction  is 
much  faster  than  the  transport  of  heat  by  conduction  and  convection  on  the  gas 
side  towards  the  wall.  To  obtain  the  heat  transfer  precisely  from  the  gas  to 
the  wall,  we  need  to  compute  the  radial  temperature  gradient  on  both  sides  of 
the  inner  bore  surface  as  accurately  as  possible.  The  temperature  boundary 
layer,  however,  is  very  thin  as  it  is  shown  in  Figures  22  and  24  which  implies 
very  small  grid  sizes  must  be  used  close  to  the  wall.  The  smallest  one  in 
radial  direction  for  Figure  26  is  0.73  pm  . 

The  final  simulations  are  for  the  one-phase  flows  in  the  real  gun,  i.e., 
where  mass  and  energy  are  added  via  source  terms.  The  pressure  and 
temperature  histories  at  the  breech  and  projectile,  and  the  projectile 
velocity  and  displacement  histories  for  the  laminar  flow  simulation  and  for 
the  real  gun  are  given  in  Figures  2-5.  Figures  27-40  are  a  series  of  three- 
dimensional  profiles  which  characterize  both  the  laminar  and  turbulent  flows 
inside  an  adiabatic  tube.  The  group  of  figures  associated  with  tha  laminar 
flow  are  at  two  times:  the  first  at  3.6  ms  when  maximum  pressure  is  achieved 
and  the  second  at  5.3  ms  when  the  projectile  exits  the  tube.  For  these 
calculations  a  nonuniform  radial  grid  is  used  with  the  minimum  grid  size  of 
0.73  pm  because  of  the  thinness  of  the  boundary  layer.  We  excluded  the 
profile  of  the  radial  velocity  at  3.6  ms  because  the  magnitude  of  this 
component  was  smaller  than  0.05  m/s.  Three-dimensional  profiles  show  the 
turbulent  flow  ilelds  when  maximum  pressure  occurs  at  3.6  ms  (Figures  34-37), 
and  when  the  projectile  exits  the  tube  at  5.49  ms  (Figures  38-40).  As  in  the 
Lagra.ige  gun,  the  differences  are  minimal  between  the  simulations  with 
different  turbulence  model,  but  significant  between  the  laminar  and  turbulence 
simulations. 


VI.  SUMMARY 

Special  interior  ballistic  phenomena  are  investigated  by  using  the  DELTA 
computer  code  designed  to  calculate  the  multidimensional,  two-phase  flow 
behind  an  accelerating  projectile  inside  a  gun  tube.  Details  of  the  heat 
transfer  and  turbulence  submodels  used  in  the  simulations  are  given  in  this 
paper.  The  general  mathematical  model  and  numerical  algorithm  are  described 
in  a  companion  paper  Ref.  [1].  Results  are  given  for  two  types  of  idealized  , 
one-phase  interior  ballistics  gun  simulations:  the  pure  expansion  flow,  and 
the  flow  with  moving  mass  and  heat  sources.  Comparison  are  made  between 
laminar  and  turbulent  flows  as  well  as  between  flows  in  an  adiabatic  tube  and 
in  a  tube  that  allows  heat  transfer. 
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Figure  2.  Pressure  histories  at  the  center  of  the  breech  and  the 

projectile  base  for  the  laminar  flcv  with  adiabatic  walls 
in  the  Lagrange  gun  (LG)  and  real  gun  (RG)  simulations. 
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Figure  3.  Temperature  histories  at  the  center  of  the  breech  and  the 
projectile  base  for  laminar  flow  with  adiabatic  walls  for 
the  Lagrange  gun  (l G)  and  real  gun  (R6)  simulations. 
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Lagrange  gun,  laminar  flow,  adiabatic  walls: 
Spatial  distribution  of  the  axial  gas  velocity 
at  the  time  of  muzzle  clearance.  Mesh  is 
uniformly  spaced  in  axial  direction  and 
nonuniformly  spaced  in  radial  direction. 
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Figure  7.  Lagrange  gun,  laminar  flow,  adiabatic  walls: 

Spatial  distribution  of  the  axial  gas  velocity 
at  the  tine  of  muzzle  clearance*  Mesh  is 
nonuniformly  spaced  in  both  directions. 


Figure  8.  Lagrange  gun,  laminar  flow,  adiabatic  walls: 

Spatial  distribution  of  the  gas  temperature  at 
the  time  of  muzzle  clearance. 
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Figure  9.  Lagrange  .gun,  laminar  flow,  adiabatic  walls: 

Spatial  distribution  of  the  gas  pressure  at  the 
time  of  muzzle  clearance. 
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Figure  12.  Lagrange  gun:  Temperature  histories  at  the  center 
of  the  breech  for  both  laminar  and  turbulent  flows 
with  adiabatic  walls,  and  for  laminar  flow  with  heat 
transfer. 
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Figure  14.  Lagrange  gun,  turbulent  flow,  adiabatic  walla: 

Spatial  distribution  of  the  axial  gas  velocity 
at  the  time  of  muzzle  clearance. 
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Figure  15. 


Lagrange  gun,  turbulent  flow,  adiabatic  walls: 
Spatial  distribution  of  the  radial  gas  velocity 
at  the  time  of  muzzle  clearance. 
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Figure  16.  Lagrange  gun,  adiabatic  walls:  Radial  profiles 
of  the  axial  gas  velocity  for  both  laminar  and 
turbulent  flow  at  the  time  of  muzzle  clearance 
at  0.25  a  away  from  the  muzzle. 
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Figure  20.  Lagrange  gun,  laminar  flow,  heat  transfer; 

Spatial  profile  of  the  radial  gas  velocity 
at  the  time  of  muzzle  clearance. 
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Figure  23.  Lagrange  gun,  laminar  flow:  Radial  profiles 
of  the  axial  gas  velocity  for  adiabatic  and 
heat  permeable  walls  at  the  time  of  muzzle 
clearance  0.25  m  away  from  the  muzzle. 
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Figure  24.  Lagrange  gun,  laminar  flow:  Radial  profiles 
of  the  gas  temperature  for  adiabatic  and  heat 
permeable  walls  at  the  time  of  muzzle  clearance 
0.25  m  away  from  the  muzzle. 
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Figure  25.  hagrange  gun,  laminar  flow:  Radial  profiles 
of  the  radial  gas  velocity  for  adiabatic  and 
heat  permeable  walls  at  the  time  of  muzzle 
clearance  0.25  m  away  from  the  muzzle. 
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Figure  26.  History  of  the  wall  surface  temperature  for  a  laminar 

expansion  flow  (LG)  with  heat  transfer  to  the  tube  wall 
at  the  locations  150  mm  (inside  the  chamber)  and  250  mm 
(inside  the  barrel)  away  from  the  breech. 
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Figure  27.  Real  gun,  laminar  flow,  adiabatic  walls 
Spatial  distribution  of  the  axial  gas 
velocity  at  3.6  ms. 
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Figure  29.  Real  gun,  laminar  flow,  adiabatic  walls: 

Spatial  distribution  of  the  gas  pressure 
at  3.6  ms. 


Figure  34.  Real  gun,  turbulent  flow,  3diabatic  walls: 
Spatial  distribution  of  the  axial  gas 
velocity  at  3.6  ms. 
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Figure  36.  Real  gun,  turbulent  flew,  adiabatic  walls: 

Spatial  distribution  of  the  gas  temperature 
at  3*6  ms. 


Figure  38.  Real  gun,  turbulent  flow,  adiabatic  walls 
Spatial  distribution  of  the  axial  gas 
velocity  at  muzzle  clearance  (5.49  ms). 


Figure  39.  Real  gun,  turbulent  flow,  adiabatic  walls: 

Spatial  distribution  of  the  radial  gas 
velocity  at  muzzle  clearance  (5.49  ms). 


Figure  40.  Real  gun,  turbulent  flow,  adiabatic  walls: 


Spatial  distribution  of  the  gas  pressure  j 

at  nuzzle  clearance  (5.49  ms).  i 
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ABSTRACT.  An  explicit,  analytical  model  is  presented  of  finite  amplitude 
waves  in  shallow  water.  The  waves  in  question  have'  two  independent  spatial 
periods,  in  two  independent  horizontal  directions.  Both  short-cre3ted  and 
long-crested  waves  are  available  from  the  model.  '  Every  wave  pattern  is  an 
exact  solution  of  the  Kadomtsev-Petviashvili  equation,  and  is  based  on  a 
Riemann  theta  function  of  genus  2.  These  bi-periodic  waves  are  direct 
generalizations  of  the  well-known  (simply  periodic)  cnoidal  waves.  Just  as 
cnoidal  waves  arc  often  used  as  one-dimensional  models  of  "typical"  nonlinear, 
periodic  waves  in  shallow  water,  these  bi-periodic  waves  may  be  considered  to 
represent  "typical"  nonlinear,  periodic  waves  in  shallow  water  without  the 
assumption  of  one-dimensionality. 

EXTENDED  SUMMARY.  Waves  in  shallow  water  are  familiar  to  virtually 
everyone.  The  objective  of  the  work  presented  here  is  to  construct  an 
analytical  model  of  waves  in  shallow  water  that  is  simple  and  explicit  enough 
to  be  useful  for  engineering  purposes,  without  being  so  simple  that  it  fails 
to  describe  realistic  wave  patterns. 

Some  of  the  features  that  occur  in  "fcjpical"  waves  in  shallow  water  may 
be  seen  in  Figures  1  and  2.  Figure  1,  taken  from  an  ancient  National 
Geographic  (1933)*  shows  a  very  regular  train  of  one-dimensional  waves  off  the 
coast  of  Panama.  (By  "one-dimensional",  we  mean  that  the  surface  pattern  is 
one-dimensional,  with  virtually  no  variation  along  the  wave  crests.  We  call 
the  waves  in  Figure  2  "two-dimensional",  because  the  surface  pattern  is 
two-dimensional.  In  this  terminology,  there  can  be  no  three-dimensional  waves 
on  the  two-dimensional  water  surface,  even  though  the  velocity  fields  may 
exhibit  vertical  structures.  We  emphasize  that  this  is  only  a  semantic 
convention.) 


♦Presented  at  the  Second  Army  Conference  on  Applied  Mathematics  and  Computing, 
May  22-25,  1984,  Troy,  NY 
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Two  features  of  the  waves  in  Figu  e  1  are  worth  noting.  The  first  is  the 
very  evident  spatial  period  of  the  waves.  The  second  is  that  the  periodic 
wave  pattern  is  far  from  sinusoidal:  the  wave  crests  are  localized  and  rather 
steep,  while  the  troughs  are  very  broad  and  flat.  The  usual  linear  theory  of 
infinitesimal  water  waves  \.e,g..  Stoker,  1957)  predicts  a  sinusoidal  wave 
pattern,  and  the  deviation  of  these  waves  from  a  sinusoidal  shape  is  a  measure 
of  how  nonlinear  the  waves  are.  Thus,  Figure  1  ?  -ggests  that  a  good  model  of 
"typical"  waves  in  shallow  water  should  admit  waves  that  are:  (i)  periodic, 
and  (ii)  nonlinear. 

Figure  2,  oaken  by  Mr.  T.  Toedtemeier  off  the  coast  of  Oregon,  shows  an 
oblique  interaction  of  two  waves  in  shallow  water.  As  in  Figure  1,  each  wave 
has  a  sharp,  localized  crest  and  a  broad,  flat  •#*.  Moreover,  each  of  the 
interacting  waves  is  part  of  a  periodic  wave  trax.i,  but  their  wavelengths  are 
so  long,  relative  to  the  local  water  depth  (about  three  feet,  according  to  tha 
photographer),  that  they  behave  like  interacting  solitary  waves.  The  dominant 
effect  of  the  interaction  on  each  of  the  two  wave  crests  is  that  each  crest 
experiences  a  phase  shift  as  a  result  of  the  interaction.  The  localized 
crests  indicate  that  each  (periodic)  wave  is  individually  'nonlinear;  the 
phase  shift  indicates  that  the  interaction  is  also  nonlinear. 

The  waves  in  Figure  1  happen  to  be  one-dimensional,  but  those  in  Figure  2 
clearly  are  not.  In  general,  one  would  expect  "typical"  waves  on  the 
two-dimensional  water  surface  to  be  two-cimensional. 

Thus,  we  seek  a  model  of  waves  in  shallow  water  that  are:  (i)  periodic; 
(ii)  nonlinear;  and  (iii)  two-dimensional.  In  fact,  we  want  the  simplest 
model  possible  that  has  these  three  properties.  The  main  conclusion  of  this 
paper  is  that  such  a  model  now  exists.  In'  this  summary  we  exhibit  some 
consequences  of  the  model.  Full  details  will  be  published  elsewhere  (Segur  & 
Finkel,  1984). 


The  Kadomtsev-Petviashvili  (KP;  1970)  equation. 
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is  a  scaled,  dimensionless  equation  that  describes  the  evolution  of  long  water 
waves  of  moderate  amplitude  as  they  propagate  primarily  in  one  direction  in 
shallow  water  of  uniform  depth,  without  dissipation.  Mathematically,  the  KP 
equation  generalizes  the  Korteweg-deVries  (KdV)  equation, 


ut  ♦  6uux  +  uxxx 


0. 


(2) 


Physically,  the  derivation  of  (1)  is  very  similar  to  that  of  (2),  except  that 
the  waves  in  (2)  are  required  to  be  strictly  one-dimensional,  while  those  in 
(1)  may  be  weakly  two-dimensional  (see  Ablowitz  &  Segur,  1979  for  the 
derivation). 
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The  KP  equation  also  generalizes  the  KdV  equation  in  the  sense  that  both 
are  completely  integrable.  In  particular,  Satsuma  (1976)  snowed  that  the  KP 
equation  admits  a  two-soliton  solution  of  the  form, 


u  (x,y,t)  »  23x  In  F, 


where 


F  -  1  +  exp  (n i )  +  exp  (ru)  exp  (m  +  ru  +  A)  , 
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A  particular  two-soliton  solution  is  shown  in  Figure  3*  The  qualitative 
agreement  between  the  wave  patterns  in  Figures  2  and  3  is  clear.  Whether 
there  is  also  quantitative  agreement  requires  more  detailed  information  about 
the  ocean  wave  than  is  available. 

This  2-soliton  solution  of  the  KP  equation  is  nonlinear  and 
two-dimensional,  but  it  is  not  periodic.  To  achieve  the  desired  model,  we 
must  generalize  the  2-soliton  solutions  of  the  KP  equation  to  include  periodic 
waves. 

Krichever  (1976)  showed  that  the  KP  equation  admits  periodic  and 
quasi-per iodic  solutions  in  the  form 


u  (x,y,t)  *  ?3X  ®  ($i»  •••• 


where  0  is  a  Riemann  theta  function  of  genus  N.  In  the  simplest  case,  where 
N  -  1, 

♦  =  u  (x  +  py  -  ct)  ,  (5) 

0(<f>)  *  ^  exp  bn  +  i  n  <f>j  ,  Re  (b)  <0  , 

(4)  yields  the  usual  cnoidal  wave  solution  that  is  familiar  from  KdV  theory 
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(e.g.,  see  Sarpkaya  A  Isaacson,  1981).  Figure  4  shows  a  typical  cnoidal  wave 
solution  of  (1);  it  bea-s  a  clear  resemblance  to  -tie  wave  shown  in  Figure  1. 
Moreover,  cnoidal  waves  from  KdV-theory  have  been  validated  in  extensive 
experimental  tests  as  accurate  models  of  one-dimensional  waves  in  shallow 
water. 


Cnoidal  waves  are  periodic  and  nonlinear,  but  they  are  not 
two-dimensional.  To  achieve  the  desired  model,  we  must  generalize  the  cnoidal 
wave  solutions  of  the  KP  equation  to  two  dimensions. 

The  required  generalization  comes  by  taking  N  *  2  in  (4),  and  using 
certain  results  of  Dubrovin  (1981).  This  family  of  exact  solutions  are  called 
KP  solutions  of  genus  2\  they  have  eight  free  parameters.  There  is  a  sense 
in  which  they  represent  an  oblique  and  nonlinear  superposition  of  two  trains 
of  cnoidal  waves.  In  o-e  limit,  these  solutions  resemble  a  slightly  modulated 
cnoidal  wave,  as  in  Figure  5a.  In  another  limit,  they  become  periodic 
generalizations  of  the  2-soliton  solution,  as  in  Figure  5b.  However,  they 
also  represent  waves  outside  of  either  limit,  as  in  Figure  5c. 

These  figures  demonstrate  that  it  is  possible  to 'construct  a  variety  of 
wave  forms  from  the  KP  solutions  of  genus  2.  By  itself,  this  possibility  does 
not  qualify  this  family  of  solutions  as  a  physical  model  of  water  waves.  We 
must  also  give  an  explicit  algorithm  to  specify  every  free  parameter  of  the 
solution  in  terras  of  measured  physical  quantities.  One  such  algorithm  was 
given  by  Sdgur,  Finkel  A  Philander  (1983).  A  different  algorithm,  based  on  a 
different  kind  of  data,  is  given  by  Segur  &  Finkei  (1984).  Once  this 
algorithm  is  given,  the  model  can  be  tested  experimentally,  and  we  invite 
interested  experimentalists  to  do  so. 

This  work  was  supported  in  part  by  tl.  Army  Research  Office,  by  the 
Office  cf  Naval  Research  and  by  NSF  Grant  MCS  '08814(A01).  We  are  grateful 
to  Terry  Toedtemeier  for  permission  to  use  Figure  2. 
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An  exact  KP  solution  of  genus  2  showing  a  "typical"  pattern  of  periodic  waves 
of  finite  amplitude  in  shallow  water. 
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ABSTRACT 

In  investigating  the  scattering  response  of  a  periodic  surface,  the 
use  of  incomplete  or  inappropriate  basis  functions  for  representing  the 
field  (s)  induced  on  the  scattering  surface  has  given  rise  to  what  is  now 
called  the  Rayleigh  Hypothesis  (RH) .  Here  we  use  normalised  Fourier  bases 
for  tnis  purpose  and  develop  a  T-natrix  which  completely  characterises 
the  scalar  scattering  response  of  such  a  surface.  The  Rayleigh  limits  are 
effectively  bypassed,  and  the  obtained  solutions  are  seen  to  obey  unitar- 
ity  as  well  as  reciprocity  constraints.  We  also  show  that  the  measurement 
of  the  scattered  field  can  lead  to  two  different  interpretations  of  the 
nature  of  the  scattering  surface  in  inverse  shape  problems. 


INTRODUCTION 


The  scattering  of  waves  —  be  they  acoustic,  electromagnetic,  or 
elastic,  —  by  periodic  surfaces  lias  been  the  subject  of  much  investigation 
ever  since  Rayleigh  studied  the  scattering  response  of  sinusoidal  reflection 
gratings  [1].  He  expanded  the  incident  and  the  scattered  fields  in  terms 
of  relevant  incoming  and  outgoing  planewaves,  repsectively,  and  these 
decompositions  are  used  to  this  day  in  such  problems.  However,  he  expressed 
the  field (s)  generated  on  the  periodic  surface  in  terms  of  outgoing  plane* 
waves  alone,  a  premise,  —  now  called  the  Rayleigh  Hypothesis  (RH) ,  — 
which  involves  an  incomplete  basis  set,  and  can,  therefore,  be  used  for 
shallow  corrugations.  In  a  classic  paper,  Millar  [2]  has  shown  that 
for  2-D  scalar  problems  involving  a  surface  S2 :  x3  (x2  ,x  )  =  h  cos  (2-rrXj/L) 
the  RH  is  applicable  for  h/L  £  0.072.  For  the  corresponding  3--D  problems, 
we  believe  that  Goodman’s  estimate  [3]  of  h/L  S  0.0504  is  correct,  the 
surface  S3:  x,  (x2,x1)  =  h  [cos  (2ttx2/L)  +  cos  ^ttXj/L)]. 

Since  then  several  efforts  have  been  made  to  bypass  the  above-mentioned 
Rayleigh  limits  on  the  maximum  gradient  of  periodic  surfaces .  Most  of  these 
methods  fall  into  two  categories.  Methods  of  the  first  kind  involve  the 
solution  of  an  integral  equation  (IE)  [4,5];  while  the  second  type  are 
essentially  matrix  procedures  [3,  5-12].  Though  the  IE  methods  have  been 
very  successful  in  dealing  with  highly  corrugated  S2,  their  use  for  S3  is 
extremely  cumbersome  because  of  tedious  oemputations.  Hence,  matrix  methods 
offer  the  or.ly  choice  for  3-D  problems.  In  this  connection,  DeSanto 
[6]  has  formulated  coupled  integral  equations  which  are  converted  into 
matrix  equations  using  relevant  expansions  for  the  fields  of  interest.  A 
more  elegant  approach  is  due  to  Waterman  [12j  who  used  the  'extinction' 
theorem  to  formulate  a  T-matrix  which  characterizes  the  scalar  scattering 
response  of  periodic  S2.  This  method,  known  as  the  T-matrix  procedure, 
involves  an  understanding  of  the  scattering  problem  from  first  principles 
using  the  Huyghen's  and  the  love's  equivalence  principles.  Recently, 
this  approach  has  also  been  extended  to  elastic  scattering  problems  as  well 
[8,9].  ‘ 

Nevertheless,  the  expansion  of  the  surface  field (s)  in  terms  of  only  the 
incoming  planewave  bases  for  the  T-matrix  approach  has  proved  to  be  a  stumbl¬ 
ing  block  in  its  application  for  highly  corrugated  surfaces.  Such  an  expans¬ 
ion  is  as  incomplete  as  the  one  used  by  Rayleigh;  consequently,  this  method 
has  suffered  from  the  same  limitations.  Recently,  however,  using  a  hybrid 
T-matrix  -  point-matching  technique,  wherein  the  surface  field (s)  is  express¬ 
ed  in  terms  of  both  incoming  and  outgoing  planewave  bases,  the  applicability 
of  the  method  has  been  increased  to  higher  corrugations  than  previously  possi¬ 
ble.  We  have  used  this  hybrid  technique  for  scalar  [13]  as  well  as  elastic 
[9]  scattering  problems  involving  S2  surfaces. 

Specifically  for  scalar  problems ,  Four:.  'oases  have  been  used  for 
representing  the  surface  field  (s)  in  the  T-matrix  framework  [3]  and  with 
success  as  evinced  by  the  data  published  in  [14].  On  the  other  hand,  using 
these  same  Fourier  bases  for  computing  elastic  responses  by  Chuang  and  John¬ 
son  [8]  has  not  lifted  the  T-matrix  approach  from  within  the  Rayleigh  limit 


L  L 
2  ,  2 


a“  =  -  dx  dx 
pq  )  1  )  2 

0  0 


(V2ikL  La Ypq)  • 


•X  •  {exp(-ik.x)  Vjp(x)}  ,  (13) 

* 

the  vector 

X  =  (-F  »-F  /l). 

^  1  2 

In  order  to  solve  the  problem  all  we  need  now  is  the  surface  field  represent¬ 
ation,  which  we  assume  to  be  [6,15] 
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finally,  substitution  of  a  truncated  (14)  in  (13)  leads  io  the  matrix 
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At  this  point  we  remark  that  replacing  the  k _ by  either  of  the  k _  vectors 
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would  ccnpletely  debilitate  the  T-matrix  procedure  and  subject  it  to  the 
Rayleigh  limits,  in  its  ability  to  handle  deeply  corrugated  surfaces. 

Likewise,  for  the  case  H  the  boundary  condition  (4)  is  substituted  in  (6) 
as  also  the  assumed  surface  field  expansion 
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NUMERICAL  RESULTS 


The  system  of  equations  (16)  and  (19)  were  programmed  on  a  EEC  vax  11/ 
730  mimoorputer.  The  inversion  of  matrices  involved  was  carried  out  using, 
a  LCJ  de<^onposition  technique  [16]  via  an  IMSL  subroutine  LECTlC,  our  numerical 
proced'jre  being  implemented  in  double  precision  arithmetic.  The  Q  matrices, 
themselves,  ware  computed  using  a  two-dimensional  Gauss-Leqendre  quadrature 
scheme  [17],  although  for  special  cases  cf  boundary  profiles  these  matrices 
can  be  evaluated  in  closed  forms.  Convergence  of  the  solution  was  checked  by 
ensuring  that  the  scattered  field  coefficients  converged  to  within  0.5%.  An 
additional  check  was  also  provided  by  (22)  which  had  to  be  satisfied  to  be 
within  ±0.005  of  unity. 

The  general  theory  presented  in  the  previous  section  holds  for  both  S2 
and  S3  surfaces  having  periodic  boundary  profiles.  However,  here  we  consider 
surfaces  described  by  . .  (x  ,x  )  =  f(x  )  +  f  (x  ),  f(x)  =  h  oos(2irx/L) ;  for  S2, 

3  2  12  1 

f  (x  )  is  set  to  zero.  The  boundary  conditions  prevailing  on  the  surface  can 
be  either  Dirichlet  or  Neumann. 

Consider,  first,  Fig.l  where  we  have  plotted  the  scattered  powers  for 
a  S2 ,  and  have  compared  our  calculations  with  those  of  Hoi ford  1 5 J .  As  is 
clear  from  this  figure,  the  improved  T-matrix  scheme  is  applicable  for  much 
higher  values  of  che  parameter  h/L  than  the  Rayleiu--  limit  of  0.072. 


1.  Reflected  mode  power  P  computed  using  the  presented 

approach  for  a  sinusoidal  S  ,when  a  planewave  is  incident 
at  0O  =  15°,  <p0  *  0°;  kL  *»  4tt.  Ref.  5  are  the  IE  results, 
(a)  Dirichlet  b.c.,  (b)  Neumann  b . c . 


Similarly,  in  fable  I  vra  consider  a  doubly  sinusoidal  S3  surface,  for  which 
case  scattering  is  observed  in  9  separate  directions.  Again,  note  that  h/L 
=  0.15,  which  is  roughly  three  times  higher  than  Goodman's  conjecture  of 
0.0504  for  the  Rayleigh  limit.  In  these  calculations,  as  in  others  made  by  us, 
we  have  been  careful  to  tolerate  only  a  0.5%  error  in  the  check  for  the  conser 
vation  of  energy,  and  this  seems  to  serve  adequately  as  a  check  on  the  conver¬ 
gence  of  the  scattered  field  coefficients  as  well.  Reciprocity  of  the  scatter¬ 
ing  solution  has  also  been  confirmed  as  is  shown  by  oonparing  the  data  in 


Table  I.  Scattering  of  a  normally  incident  planewave  from  S3; 

h  =  0.426,  L  =  2.84,  k  =  3.5.  Each  entry  represents 
?  of  (21). 


±P»  ±9 


Dirichlet  b.c. 


Neumann  b.c 


0.11534 

0.52718 

0.03792 

0.04348 

0.18407 

0.07574 

1.0033 


1.0041 


V 


Tables  II  and  III.  First,  from  Table  II  we  see  that  the  scattered  power  in 

the  direction  8  =  62.2461%  <j>  =  0°  is  0.13240  for  the  case  S  and  0.21042 

for  the  case  H,  when  a  planewave  is  incident  normally  on  S3 .  Next,  in 

Table  HI,  the  exciting  planewave  is  incident  at  8  =  62.2461°,  <*>  =0°. 

0  0 

For  this  latter  excitation,  the  scattered  power  in  the  normal  direction 
was  ccrputed  to  be  0.13231  for  the  Dirichlet  b.c.  and  0.21063  for  the  Neu¬ 
mann  b.c. ,  thus  demonstrating  the  satisfaction  of  the  reciprocity  constraints. 


Table  II.  Same  as  Table  I  except  h  =  0.284,  L  *  2.84, 
and  k  =  2.5. 


±p,  ±q 

Case  S 

Case  H 

0,0 

0.47167 

0.15755 

1,0) 

0,1/ 

0.13240 

0.21042 

E  1.0012  0.9992 


Table  III 

.  Same  as  Table  II 
incident  from  8 

0 

except  the  incident  wave  is 

*  62.2461°,  <f>  -  0°. 

0 

p.  q 

Case  S 

Case  H 

0,0 

0.83265 

0.43256 

-1,0 

0.13231 

0.21063 

-1,±1 

0.01476 

0.11957 

-2,0 

0.00818 

0.11768 

E 

1.0027 

1.0000 

A  NGN-UNIQUENESS  OF  THE  INVERSE  SHAPE  PROBIEM 

As  has  been  seen  in  the  preceding  sections,  a  periodic  surface  scatters 
an  incident  planewave  in  discrete  well-defined  directions.  Seme  of  these 
directions,  for  which  y  is  real,  have  scattered  planewaves  which  go  i^Jto 

z  =  00 .  Others,  for  which,  y  is  imaginary,  represent  evanescent  planewaves. 

In  the  far  zone,  the  reflection  coefficients  a+  can  be  measured  for  the 

pq 

propagating  planewaves;  hence  the  reflected  field  can  be  obtained  f.xm  measure¬ 
ments  as 


(23) 


ip  (x  ,x  )  -  Z  a+  expdkr  oos(0  -0)};  kx  large 

3  3  1  p  p«  p  3 

where  we  have  considered,  for  the  sake  of  brevity,  only  the  2-D  problem; 

=  arctan(a  .Y_,>) ;  9  =  arctardx  ,  x  ) ;  and  r  is  the  radial  distance  from  the 
p  P  pQ  13 

origin  to  the  field  point.  The  aunmation  holds  only  for  the  propagating  plane- 
waves.  However,  in  most  situations,  the  scattering  surface  is  finite  of  total 
expanse  B.  Hence,  if  the  surface  were  to  be  illuminated  by  a  finite-aperture 
field  .  f 


|A(x)  -  |exp(ik£.x) ,  x  e  S2,  -B/2  <_  x  <_  B/2 
i  0  ,  x  e  S2,  jx|>  B/2 


then,  for  a  sufficiently  usual  case  when  the  Rayleigh-Wood  anomalies  are 
absent,  the  scattered  field  has  been  given  by  Jordan  and  Lang  [7]  to  be 

ip  (x)  =  kB  (2rrkr)  **  exp[i(kr-*rr/4)  ]  Z  a*  sinc(kB(sin0  -s.in0)/2] .  (25) 

s  p  pu  p 

Ebr  (25)  to  hold,  kB  trust  be  large;  and  only  the  propagating  plane  waves  need 
be  accounted  for. 

Consider,  next,  a  flat  surface  of  the  same  expanse  B  which  is  illuminat¬ 
ed  also  by  the  field  (24) .  This  flat  surface  has  a  periodic  reflectivity 
profile  p  of  period  L,  the  reflectivity  function  being*  dependent  on  the  fre¬ 
quency.  The  scattered  field  field  can  be  easily  set  down  as 

Js B 


j*(x  ,  x  )  dx'  p(x')  exp  (ilex' sin0  )• 

3  11  1110 


•  exp[ik(x2  +  (x  -x')2}35]  (x2  +  (x  -x')2}-35  (26) 

3  11  3  11 

where  0  =  arcs  in  (a  ) .  Because  of  the  periodic  nature  of  p,  this  can  be  red- 
0  0 

uced  to 


p(x)  =  Z  o  exp  (ip2ra/L) 

P  N 

p1  (x  ,x  )  =  Z  p  Z  exp  (-ik£Lsin8  ) 
31  p  p  £=-N  0 


/dx' 
-kL 


(27a) 


(x2  +  (x  -x'+£L)2}  3s* 


•exp(ikx'  sin0  )exp[ik(x2  +  (x -x’+£L)  2},s] , 

1  P  3  11 


(27b) 


and  which,  by  approximating, 

(x2+  (x  -  x'+fcL)2]5*  =  [x2  +  (x  +£L)2]3$  - 

311  31  ,  -lc 

-  x'  (x  +£L)  [x2+  (x  +£L) 2  ]  % 

11  3  1 

further  reduces  to 

N  ,  , 

ii»1(x  ,x  )  =  E  pn  Z  exp (-ik£Lsin8  )exp  ik(x2+  (x  +£L)2F  r~^* 
3  1  p  P  fc»-N  0  31 


(27c) 


•sinc{(kI/2)  (sin0  -  (x  +£L){xz+(x  +£L)2]  **) } 

pi  3  1 


.  _  Tt 


_  .  i..-.  CTf  ITi  l.  IT*  n i-W  i-'SX 


.T.L.-.  .-s  v^v.^iv.v.rauwJ.' 
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SOLITARY,  PERIODIC  AND  CHAOTIC  WAVES  IN  THIN  FIIMS 
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ABSTRACT .  A  three-dimensional  phase  space  analysis  is  carried  out  for  a 


parabolic  fourth  order  partial  differential  equation.  This  equation  describes 


the  time  evolution  of  a  class  of  nonlinear  waves  observed  in  viscous  liquid 


films,  diffusion  flames  and  certain  chemical  oscillations.  Solitary  waves,  almost 


periodic  waves,  as  well  as  locally  periodic  but  globally  chaotic  waves  are  found 


numerically. 


I.  IfflRCDUCTEON.  Lin  [1]  found  that  the  supercritieally  stable  monochromatic 


waves  [2-4]  in  a  liquid  film  flowing  down  an  inclined  plane  were  stable  only  with 


respect  to  side-band  disturbances  of  small  band-widths.  With  respec  :  to  distur¬ 


bances  of  arbitrary  band-widths,  the  waves  were  unstable  due  to  modal  .interactions 


(51.  For  the  nonlinear  wave  evolution  with  strong  modal  interacrtion,  he  obtained 


th-r  following  evolution  equation: 


n-2  3 f  s 


+  jc  (R-R  )a~l  |-4  +  aW  JL£  +  e  +  F  9  X  +  —  = 

15  1  3  3  3v*  3vd  avi> 


where  a  =  (2i\ho/L) ,  ho  and  L  being  respectively  the  average  film  thickness  and 


the  wave  length,  is  a  snail  parameter,  n  is  a  cxnstant  to  be  chosen  for  proper 


time  scale  of  evolution,  f,  T  and  x  are  suitably  normalized  wave  amplitude,  time 


and  distance  in  the  direction  of  wave  propagation  respectively,  and  e,  F  and  r1 
are  parameters  depending  on  the  Reynolds  nurrber  R,  the  Weber  nunber  W,  and  the 


angle  8  between  the  incline  and  the  horizontal,  i.e. 


32  „2 


E  =  ~  R  +  2  -  R  Cot  8  ,  F  =  —  or  RW  +  rf-  a  *RK  , 


40  2 

—  -  . ...  /v4. 


r.  2  DM 


5  2 

J?2  =  ^  (Cot  B  -  a  W)  , 


g  h  sin  8 
o 


pgh  sin  3 
o 


*  *  '  1  *  •* 


a  ,-v 


in  which  9  is  the  gravitational  acceleration,  v  the  kinematic  viscosit}'  and 
a  is  the  surface  tension.  Eq.  (1)  is  an  appropriate  model  equation  in  the  flew 
parameter  range  such  that 


R  -  r2  =  0(a)  , 


aw  =  0(1) . 


The  appropriate  time  scale  in  this  range  is  n=2.  For  a  much  larger  surface 
tension  such  that  a 2w  =  0(1) ,  we  put  H  =  af  into  Eq.  (1)  and  obtain 

(2) 


8 


H  +  ■—  (R-R.)H  +  f-  a2WH  +  4HH  =  0 

T  15  1  xx  3  xxxx  x 


where  terms  of  0(a)  have  been  neglected  and  n=l  for  the  appropriate  time  scale. 

Eq.  (2)  can  be  suitably  normalized  to  read 

Gt  +  G+  G  +  G  =  0  ,  (3) 

where  G  is  the  wave  amplitude,  and  t  and  s  are  respectively  the  time  and  space 
variables.  Eq.  (3)  is  a  model  equation  expected  to  be  asymptotically  valid  in 
the  range  r-r^  =  0(1) ,  a 2w  =  0(1) .  Eq.  (3)  has  also  been  independently  obtained 
by  Atherton  and  Hansy  [6,12] ,  Nepcmnyashchii  [7] ,  and  Sivashinski  and  Michelson 
[8]  in  the  context  of  film  waves.  Bq.  (3)  was  also  found  to  describe  the  nonlinear 
evolution  of  diffusion  flame  instability  [9],  certain  chemical  oscillation  [10] 
and  the  Ravleigh-Tbylor  instability  in  a  top-heavy  two-layered  film  [11] .  Thus, 
despite  its  limited  range  of  applicability,  it  does  have  some  general  dynamic 
properties.  The  cormon  feature  the  different  problems  governed  by  Bq.  (3)  have 
is  that  all  of  these  problems  contain  a  nonlinear  driving  force,  a  diffusion 
process,  and  a  restoring  force,  but  no  dispersion  effect.  The  dispersion  effect 
is  represented  by  the  third  derivative  term  in  Eq.  (l).  We  point  out  that  both 
equations  (1)  and  (3)  were  derived  in  a  reference  frame  moving  with  the  Kinematic 
wave  velocity  which  is  three  tines  the  average  fluid  velocity  in  the  film. 

^umer-'-ca^  solutions  of  Eq.  (3)  were  obtained  by  Atherton  and  Hcmsy  [12]  with 
a  modified  algorithm  used  by  Thppert  [13]  for  the  solution  of  the  Kortweg-de  Vries 


equation.  Trio y  found  that  tlie  consequence  of  nonlinear  evolution  was  a 
stationary  periodic  wave  which  is  insensitive  to  the  initial  data.  The  initial 
data  used  by  them  included  large  amplitude  sine  waves.  In  addition  to  periodic 
waves  Tsvelodub  [14]  and  others  [ 15 ,16, IS J  also  found  stationary  soli ton  solu¬ 
tions  of  Eq.  (3).  However,  Sivashinsky  and  Michelson  [8]  and  Kuramoto  [10] 
reported  recently  that  the  numerical  solution  of  Eq.  (3)  with  large  amplitude 
sine  waves  as  th.  nitial  data  resulted  in  diaotic  waves.  There  seems  to  be  a 
puzzling  inconsistency  among  existing  works.  'Ihe  purpose  of  this  work  is  to 
resolve  this  puzzle  und  to  gain  a  deeper  understanding  of  the  physics  of  the 
•s'-xilinear  evolution  of  a  class  of  problem  modeled  by  Eq.  (3) . 

First  ve  seek  a  static nary  solution  of  Eq.  (3),  and  cast  it  into  a 
-  ■  .deal  sy  sic  •»-  Then  we  determine  the  properties  of  its  fixed  points.  Guided 
by  i  'aese  pr  .  »»*  ’  ±es  we  obtain  the  integral  curves  and  study  their  structure  by 
portraying  them  on  the  Poincare  sections  and  phase  planes.  From  these  phase 
portraits,  we  seek  the  conditions  which  may  lead  to  periodic,  solitary  or  chaotic 
wave  patterns. 

II .  WAVES  IN  PHASE  SPACE.  Consider  stationary  waves  travelling,  at  a 

constant  speed  m  in  the  (c.-r)  -space.  By  use  of  the  Ga lli lean  transformation 
j-  =  q  _  Mt,  Eq.  (3)  can  be  written  as 

-M3'  +  03'  +  G"  +  G'"'  =  0,  M 

where  primes  denote  differention  with  respect  to  5 .  Ths  first  integration  of 
(4)  gives 

~(M-G/2)G  +  G'  +  G"‘  -  c  , 

where  c  is  the  integration  constant.  Let  G  =  F-d,m  «  M+d,and  c=(d2/2)+Md  in  this 
equation,  ve  have 

-(m-F/2)F  +  F'  +  F" '  =  0,  (5) 

where  d  is  an  arbitrary  constant  datum  from  which  the  amplitude  of  F  is  measured. 

Eq.  (5)  can  be  cast  in  the  dynamical  system 
F'  =  FI 

Fl'~  F2  (6) 

F2 '  =  ( m-F/2) F-Fl  . 
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as  well  as  the  location  of  the  fixed  point,  which  is  fixed  solely  by  m,  the  wave 
form  corresponding  to  this  trajectory  will  also  heavily  depend  on  the  particular 
values  of  m.  The  fourth  order  Runge-Kutta  method  has  been  used  to  integrate  Eg. 

(6) .  The  numerical  results  are  given  in  the  next  section. 

III.  NUMERICAL  RESULTS.  The  projection  of  the  integral  curve  of  Eg.  (6) 
on  the  f-fi  plan  is  plotted  in  Fig.  la,  for  the  case  of  m  =  .12161,  F(0)  =  .00125, 
Fl  ( 0)  =  .01  F2  ( 0)  =  .004.  The  trajectory  follows  initially  the  one-dimensional 
unstable  manifold  near  SI  (0,  0,  0)  before  it  is  attracted  into  the  one¬ 
dimensional  stable  manifold  near  S2  (2m,  0,  0) .  However,  the  transition  is 

marked  by  several  successive  kinks,  indicating  the  intersections  of  the  one- 

\ 

dimensional  unstable  manifolds  with  the  two-dimensional  stable  spiral  manifold 
near  si  and  with  the  two-dimensional  unstable  manifold  near  S2.  The  trajectory 
then  spirals  cutward  from  S2  along  its  two-dimensional  unstable  manifold  before 
altering  into  the  two-dimensional  stable  manifold  of  si.  Then,  it  ccmes  to  a 
point  very  close  to  the  initial  point  (.00125,  .01,  .004)  when  £  =  175.  Further 
integration  produced  an  extension  of  the  trajectory  which  almost  duplicates  its 
predecessor  fran  £=0  to  175.  The  corresponding  wave  profile  is  plotted  in  Fig.  lb. 
The  intersection  of  the  trajectory  with  the  plane  F2  =  o,  i.e.  a  Poincare  section, 
is  given  in  Fig.  lc  for  the  first  eight  successive  wave  trains.  This  figure 
reveals  clearly  the  nature  of  an  almost  periodic  wave.  Slightly  different 
initial  points  lead  to  slightly  altered  trajectories.  It  appears  that  the  pseudo 
periodic  waves  are  structurally  stable  with  respect  to  small  disturbances.  Among 
these  trajectories,  for  a  given  m,  there  appears  to  be  only  one  exactly  periodic 
wave.  Thus,  it  is  infinitely  more  likely  to  find  almost  periodic  waves  than  the 
exactly  periodic  cne.  Almost  periodic  waves  have  also  been  fcund  for  larger 
values  of  m.  Fig.  2  gives  such  an  example  for  m=4.  Note  that  the  larger  amplitude 
almost  periodic  waves  tend  to  appear  more  chaotic  over  a  distance  much  larger  than 
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the  typical  local  peak  to  peak  distance  but  remain  locally  almost  periodic. 
Similar  waves  have  also  teen  found  for  negative  values  of  m. 

Fig.  3a  gives  the  soliton  with  m  =  1.21599.  The  corresponding  hcmoclinic 
trajectory  is  projected  on  the  F-Fl  plane  in  Fig.  3b.  This  soliton  is  identical 
to  that  given  in  Fig,  1  of  Tsvelodub  except  that  his  amplitude  H  was  scaled 
differently  from  our  F.  Tsvelodub  obtained  the  soliton  by  representing  the 
solution  as  a  Fourier  integral  and  by  solving  the  resulting  integral  equation 
interatively.  ihis  soliton  is  characterized  by  a  large  elevated  peak  following 
a  small  amplitude  wave  train  ahead  of  it.  The  second  soliton  with  a  deep  trough 
trailed  by  small  amplitude  oscillations  travelling  at  m  =  -1.21599  also  exists, 
since  Eq.  (6)  remains  invariant  upon  changing  the  sign  of  £,  F  and  m. 

The  projection  of  a  trajectory  onto  the  F-Fl  plane  is  given  in  Fig.  4a  for 
the  case  of  m=0.  The  corresponding  wave  profile  is  given  in  Fig.  4b.  The 
intersection  of  the  trajectory  with  the  plane  F2  =  0,  i.e.  the  Poincare  section, 
is  given  by  Fig.  4c.  It  is  seen  from  this  figure  that  small  finite  amplitude 
periodic  waves  which  travel  exactly  at  the  Kinematic  waves  speed  dc  exist  but 
they  are  neutrally  unstable. 

IV.  DISCUSSION.  The  phase  space  study  of  Eq.  (6)  shewed  that  the  two 
types  of  soli tons  found  by  Tsvelodub  [14]  and  Shkadov  [15]  do  exist.  It  is 
very  unlikely  that  these  so]  i tens  are  endowed  with  the  stability  properties  of 
the  K-dv  [16]  solitons.  Stationary  periodic  wave  solutions  [14,17]  of  Eq.  (3) 
also  exist.  However,  these  pure  waves  are  infinitely  fewer  in  numbers  than  other 
forms.  These  forms  include  those  which  are  almost  periodic  locally  over  a  few 
“wave  length"  but  appear  chaotic  over  a  much  2arger  distance.  Our  findings  that 
most  of  the  stationary  waves  appear  locally  periodic  but  globally  randan  may 
explain  why  Atherton  and  Hcmsy  were  able  to  obtain  a  purely  periodic  stationary 
wave  of  relatively  short  wave  from  Eq.  (3)  with  an  initial  sinus iodal  wave  of  a 
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snail  length,  i.e.  F  =  .1  sin£.  Our  results  may  also  explain  why  the  initial 
sinusoidal  wave  of  a  much  larger  length  i.e.  F  =  sin (n^/ 100)  used  by  Sivashinsky 
and  Michelson  resulted  in  locally  periodic  but  globally  chaotic  waves.  CXir 
results  are  also  consistent  v/ith  the  finding  of  Pumir  [19]  et  al.  that  Eq.  (3), 
when  discretized,  possesses  positive  characteristic  exponents  only  when  the 
wave  lengtns  are  sufficiently  large,  '’he  apparent  inconsistencies  in  the  exist¬ 
ing  wor.'.s  are  new  explained.  It  is  almost  clear  that  the  ultimate  forms  of  the 
equilibrated  waves  are  very  sensitive  to  the  initial  condition,  Fcwever,  the 
precise  initial  conditions  which  must  be  net  in  order  to  produce  specific  types 
of  stationary  waves  of  Eq.  (3)  remain  unknown. 

It  should  be  pointed  out  that  not  all  of  the  predicted  chaotic  structure 
can  be  expected  to  be  observable  in  experiments,  especially  those  with  large 
amplitude.  Sane  of  the  chaotic  - tructure  may  s.inply  be  the  consequence  of  the 
neglect  of  the  physical  factors  assumed  to  be  insignificant  in  Eq.  (3) .  For 
example,  the  neglected  three  dimensional  stabilizing  surface  tension  effects  [20] 
may  alter  the  appearance  of  chaos  dramatically.  Chly  qualitative  caparisons 
between  theories  and  experiments  are  mentioned  in  the  cited  works. 
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discontinuous  dependence  of  solution  on  boundary  conditions 

FOR  LARGE  AMPLITUDE  SHOCI  UAFES1 
T.  C.  T.  Ting 

Department  of  Civil  Engineering, 

Mechanics  and  Metallurgy 
University  ot  Illinois  at  Chicago 
P.O.  Box  4348 
Chicago#  Illinois  60680 

ABSTRACT.  Plane  waves  of  finite  amplitude  in  a  simple  elastic  solid 
which  occupies  the  half  space  i  );  0  are  studied.  For  isotropic  materials 
there  are  lwo  plane  polarized  simple  raves  as  well  as  two  shock  waves  and  one 
circularly  polarized  simple  wave  which  can  be  regarded  as  a  shock  wave.  Using 
the  wave  curves  and  shock  curves  in  the  stress  space,  one  can  see  clearly  what 
combination  ot  simple  waves  and/or  shock  waves  is  needed  to  satisfy  tne 
initial  and  ooundary  conditions.  For  hyperelastic  materials,  the  wave  curves 
associated  with  different  wave  speed  are  orthogonal  to  each  other.  Examples 
are  presented  for  second  order  hyperelastic  materials.  In  one  example  we  show 
that  the  solution  requires  as  many  as  four  simple  waves.  In  another,  when  the 
amplitude  of  the  applied  shock  is  large  enough,  the  solution,  though  does  not 
blow  up  with  time,  does  not  depend  continously  on  the  boundary  conditions. 
Mathematically,  this  may  create  difficulties  for  a  numerical  solution  of  the 
problem.  Physically,  this  means  that  if  the  applied  load  at  the  boundary  is 
not  properly  controlled,  any  slight  deviation  in  the  applied  load  would 
result  xn  •  finite  jump  m  the  material  response. 

1.  INTRODUCTION.  Plane  finite  amplitude  waves  in  sisiple  elastic  solids 
have  been  studied  by  many  investigators  I1-7J .  The  deformation  gradient  was 
invariably  used  as  one  ot  the  dependent  variables.  In  [4]  Bland  showed  shat 
there  existed  three  piano  simple  waves  and  three  associated  plane  shock  waves 
in  xsotropic  hyperelastic  materials;  two  of  the  simple  waves,  and  two  of  the 
shock  waves  are  plane  polarized  with  respect  to  the  deformation-gradient  space 
and  the  remaining  simple  wave  and  shock  wave  are  circularly  polirized. 
Davison  [5]  obtained  centered  wave  simple  solutions  to  some  initial  and 
boundary  conditions  by  a  semi-inverse  approach,  i.e.,  one  assumed  a  certain 
combination  ot  simple  waves  and/or  shock  waves  to  see  what  reformation 
gradients  should  be  prescribed  as  the  initial  and  ooundary  conditions. 

One  ot  the  main  purposes  of  this  paper  ia  to  present  a  means  of 
determining  thu  correct  combination  of  siaple  waves  and/or  shock  waves  to 
satisfy  tne  prescribed  initial  and  boundary  conditions.  Since  deformation 
gradients  are  seldom  prescribed  as  the  initial  and  ooundary  conditions,  we  nae 
stress  as  the  dependent  variable,  ffe  introduce  the  'stress  paths'  for  simple 
waves  and  'stress  paths'  for  shock  waves  to  find  the  solution.  The  idea  of 
using  the  stress  paths  for  simple  waves  in  solving  the  problem  was  employed  in 
the  study  of  elastic-plastic  wave  propagation  [8,9].  However,  ?iresa  paths 


^This  is  based  on  the  work  in  collaboration  with  Tongchi  Ii.  [12] 
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fox  shock  wave  were  straight  linos  because  the  shock  wave  studied  in  [8,9]  was 
linear  and  involved  only  one  stress  consonant.  The  stress  paths  are  in  fact 
the  projection  of  wave  curves  [1C]  on  t ho  stress-space > 


2.  BASIC  EQUATIONS  FOR  PLANE  WAVES  AND  THEORY  OF  SIMPLE  TAVBS.  In  a 
fixed  rectangular  coordinate  system,  let  x^  and  X^  be,  respectively,  tne 
position  of  a  particle  at  tine  t  and  at  t  =  3  which  is  taken  as  tne 
undefomed  state.  The  siaterial  occupies  the  half  space  X^  £  0.  Assuming 
that  the  plane  wave  is  propagating  in  the  Xj-direction,  we  rave 

*i  “  Xf  +  u^(X,t)  (1) 

wnere  X  «  Xj  and  u^  is  the  displacement.  The  deformation  gradient  F  then 
has  the  expression 


1+Pj  0 


iJ  “3*J 


Pi  »  du^/dX  (3) 

Let  T  be  the  Piola-Iirchhoff  stress  tensor  of  the  xirst  kind.  For  simple 
elastic  materials  T  is  a  given  function  of  F 

T  -  T(F)  (4) 

Since  F  is  a  function  oz  X  and  t,  so  is  T.  The  elutions  of  motion  and 
the  continuity  of  displacement  can  them  be  written  as 


SX  ’  Po~d t  ”  0 

3v,  3Pj 
IX"  ~3t  “  0 

where  p0  is  the  mass  density  in  the  undeformed  state  and 


yi  “  IT 

*i  “  Til  m  ail  (8i 

In  Bq.  (8)  iv  the  Cauchy  stress  which  is  related  to  T^j  by  [11] 

J"1!?1  (9) 

wnere  J  «  ||F||  und  the  superscript  T  stands  for  tne  transpose*  The 
second  equality  of  Eq,  (8)  follows  from  the  special  form  ot  F  given  in  Eq. 
(2>. 
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Noticing  that  s^  is  s  inaction  of  pj#  P2  and  P3 

*i  =  •|(»1.^'*3)  U0) 

and  «ssuaing  chat  its  inverse  as  veil  as  the  derivatives 

Gij  =  3Pi''9*j  (11) 

exist,  ve  write  Eqs.  (5,6)  in  aatrix  notations  as 

s  —  pi  =  0  (12) 

~,X  o~,t 

i,r?»,t  =  0 

where  a  coaaa  stands  tor  the  partial  differentiation.  Introducing  tne 
following  aatrices 


p0?  oi  r  o  -f 

,  B  ** 

!  0  G  -I  0 


?  -I]  pf) 
[-1  o]  ’  J-.5j 


where  I  i»  the  unit  aatrix,  Bq,  (12)  has  the  fora 

♦er  * 

A»  .  +  Bw  T  *  0  (14) 

~-v  #  V  ~-v  I  A 

The  formulation  here  differs  froa  that  ox  [1-7]  in  that  the  dependent 
variables  are  s  and  v,  not  p  and  v.  Notice  that  B  is  a  constsnt 
aatrix  while  k  depends  on  s  only. 

3.  jgCOKP  ORDER  ISOiaOPIC  HYPBBBLA3TIC  MhlBSIhLS.  For  illustrative 
purposes  we  will  consider  isotropio  hyperelastic  materials  of  second  order, 
i.e.,  the  uefermation  gradients  arc  functions  of  ij,  <3  of  order  up  to 
two.  Hence 


be  e  e 
e  =  a  a  +  j-  <7  +Jf1* 


y  »  d  +  eo 


(15a) 


(15b) 


where  a,  b,  d  and  e  are  aaterial  constants  and 


“  *1  *  *2  *  *2  +  *3 


«  *  pi  *  r2  -  4  +  4 


For  linear  aaterials  b  =  e  *=  0  and  hence  a  and  d  can  he  identified  with 
Laae  constants  X  and  n  by 

d  =  1/m  (17) 

a  **  1(A+  2 pi) 


iV'Vv.V.VVjy, 


Since  Lane  constants  are  positive,  ire  have 


d  >  2s  >  0 


(18) 


Eqs.  (14)  yield  three  wave  speeds  cA,  (i  «  1,2,3).  We  assume  that 

Cj  2  Cj  >  f'  (19) 

Sinple  waves  associated  with  c^  and  bj  will  be  sailed,  respectively,  the 
'fast'  and  'slow*  simple  waves.  The  sinple  va^e  at  roc is ted  with  C2 
degenerates  into  a  shockwave. 


Detail  derivations  of  equations  tor  sinple  waves  can  be  found  in  [12] . 
The  stress  paths  zor  the  fast  and  slow  sinple  waves  lie  on  the  (<r,v)  radial 
planes.  They  are  orthogonal  to  each  other.  The  stress  paths  for  cj  -  wave  is 
a  circle  t  =  constant  on  a  v  <=  constant  plane.  For  the  second  order 
hypereiastic  materials  given  by  Eqs.  (15),  the  stress  paths  for  sinple  waves 
and  shock  waves  depend  on  one  uon-dinentional  parameter  k: 


These  are  four  cases  depending  on  the  values  of  k.  The  stress  paths  for 
sinple  waves  for  case  3  (0  (.  k  <.  1)  are  shown  in  Fig.  1.  Solid  lines  are 
stress  paths  for  fast  sinple  waves  and  dotted  line  are  for  alow  simple  waves. 
The  arrows  indicate  the  direction  along  which  the  wave  speed  is  decreasing. 
Thus,  if  the  stress  state  at  t  =  0  is  at  point  B  while  the  boundary 
condition  specified  at  X  -  0  is  the  point  A,  the  adnissable  stress  path  is 
BanTA.  Th*  wave  pattern  in  tne  (X,t)  plane  is  snown  in  Fig.  2.  Ve  see  that 
*a  have  four  simple  waves  in  this  example.  This  is  due  to  the  existence  of 
the  point  0  ir»  Fig.  1  where  the  fast  and  slow  wave  speeds  are  identical. 
This  it  a  singular  point  which  always  provides  some  unexpected  phenonena  [13] . 

4.  PLANE  SHOCK  WAVBS.  If  ve  denote  the  junp  of  a  quantity  f  across  a 
shook  wave  by 

If]  =  f  -  f+  (21) 

wnere  f~  and  f+  are  the  values  of  f  behind  and  ahead  of  the  shockwave,, 
the  jump  conditions  representing  the  conservation  of  momentum  and  the 
ooatinnity  of  displacements  can  ba  written  as  [11] 

[a-]  +  p^Vj]  «  0  j  (22) 

Ivi]  +  PoVCpi3  =  C  ’ 

By  eliminating  [vj]  in  Eqs.  (22)  and  in  making  use  ct  (15),  one  obtains 
equations  for  (or”,T~)  when  (<t+,t+)  and  V  are  known.  For  a  fixed  (a+,r+) . 
(v~.t~)  traces  a  stress  path  for  shock  wave  as  V  varies.  It  can  he  shown 
that  V  ->  c  as  (v+,r+)  ->  (or” ,x~) .  We  will  denote  by 


V}  (or  V3)  the  shock  wave  speed  associated  with  points  on  the  stress  path  for 
shock  wave  which  reduces  to  c^  (or  c3)  when  (<t“,t~)  approaches  (<r+,t+) . 

It  snould  be  noted  that  not  every  point  on  the  stress  path  for  shock  wave 
is  an  admissible  shock.  According  to  Lax  [14],  a  shock  wave  is  stable  if 

cT  >  v  1  ej  ,  (i  =  1,2,3)  (23a) 

Otnerwise,  the  snock  wave  may  develop  into  a  simple  wave.  We  may  also  write 
Eq.  (23a)  as 

C^\<r~',X~)  2  V^(<r~,X~  ;  u+,T+)  2  Cj(tf+,T+)  (23b) 

5.  SOLUTION  WHICH  INVOLVES  SHOCK  WAVES.  For  illustrative  purposes  we 
consider  the  following  initial  and  boundary  conditions 

(<r.x)  =  (0,0)  ,  at  t  =  0  (24) 

=  (<r*,T4)  ,  at  1  =  0 


Depending  on  the  location  of  (<t*,t*)  in  the  ( <r,x )  plane,  the  solution  may 
consist  of  a  different  combination  ot  simple  waves  and/or  shock  waves.  We 
again  use  case  3  as  an  example.  With  the  initial  value  being  xero  and  the 
boundary  value  prescribed  as  (<ra,ra),  the  solution  depends  on  the  position  of 
(<ra,xa)  in  the  (<r,t)  plane.  Fig.  3  (see  also  Fig.  1).  If  (aa,r*)  is  the  point 
Aj  in  region  1  bounded  by  the  positive  o— axis  end  OF  wnich  is  the  stress 
path  for  slow  simple  wave  from  the  origin,  we  have  the  solution  wnich  consists 
oi  two  simple  waves  as  shown  in  Fig.  4.  If  (cr*,ra)  is  outside  of  region  I, 
there  is  no  solution  wnich  consists  of  simple  waves  only.  Jn  this  case  we 
consider  a  snock  wave  Vj  with  (<r+,t+)  =  (0,0).  The  stress  path  for  shock 
wave  Vj  is  the  negative  v-axis  Oumj  and  tho  curve  uQ  wnich  is  one  branch  of 
a  hyperbola.  The  other  oranch  of  the  hyperbola  violates  the  stability 
condition  and  hence  is  not  admissible,  fr  is  the  stress  at  which 

V1(0,0  -  fr,0)  =  c3(u,0)  (25) 

We  use  double  solid  lines  (or  double  dashed  lines)  to  denote  the  stress  path 
for  admissible  shock  wave  Vj  (or  V3).  The  arrows  on  the  path  indicates  the 
direction  from  (ot+,t+)  to  ,x~) .  Thus  if  (cra,ta)  is  the  point  Aj  in 
region  II  bounded  by  Q&OP,  the  solution  consists  of  a  shock  wave  Vj  which 
carries  the  stress .  trom  point  0  to  M  followed  by  a  slow  simple  wave  c3 
which  carries  the  stress  to  ^ ,  If  ((ra,ta)  is  the  point  A3  in  region  III, 
Fig  3,  wnich  is  bounded  by  &m^  and  ?Q  the  solution  consists  of  shock  waves 
and  V3,  Fig.  4. 

Notice  that  there  are  actually  more  than  three  types  of  solutions  as 
shown  m  Fig.  4.  For  instance,  if  (ut,Ta)  is  on  the  positive  o—  axis  (or  on 
OP),  the  solution  consists  of  one  simple  wave  c^  (or  c3)  only.  Similarly, 
if  (ca,Ta)  is  on  the  negative  u-axis  or  on  «r€L  the  solution  consists  of  one 
shock  wave  V1  only. 
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6.  DISCONTINUOUS  DEPENDENCE  OP  SOLUTION  ON  BOUNDARY  CONDITIONS.  The 
stress  path  “A*.  for  shock  wave  V3  in  Fig.  3  intersects  the  a -axis  at  point 
A^.  If  (o*,t*)  is  at  point  Ag,  we  have  the  solution  in  which  a  shock 

wave  carries  the  stress  froai  point  0  to  M  followed  by  a  V3  shock  wave 
which  carries  the  stress  froai  M  to  A3.  On  the  other  hand,  if  (cr*,ta)  is  at 
point  A^,  we  have  the  solution  in  which  only  one  shock  wave  carries  the 
stress  from  point  0  to  Aj.  As  point  Aj  approaches  A^,  it  can  he  shown 
that 

V1(0;M)  =  V3<M;A4)  =  V^OjA*)  (26) 

The  constant  region  ■,  Fig.  4.  between  the  two  shocks  diminishes  to  aero  and 
the  two  shocks  coalesce  into  one.  Therefore,  in  the  limit  as  A$  approaches 

A^  the  two  solutions  are  identical.  However,  as  long  as  is  not  on  the 

v-axis.  we. have  a  constant  stress  region  n  in  which  the  shear  stress  t*  is 

finite:  although  the  constant  region  nay  he  snail  for  snail  tine  t. 

Thus  for  a  half-space  which  is  initially  stress  free,  an  inpact  at  the 
boundary  X  =  0  with  o*  <  £  and  ra  -  0  produce  one  shock  wave  which  is  a 
longitudinal  shock  because  no  shear  stress  is  generated.  If  r*  is  nonzero  but 
very  snail,  one  has  two  shock  waves  (neither  of  then  is  longitudinal)  with  a 
finite  shear  stress  between  the  two  shocks  although  the  region  between  the  two 
shock  waves  nay  be  snail  for  snail  tine.  In  experinents  this  inplles  that  a 
slight  misalignment  ox  the  longitudinal  inpact  at  X  =  0  can  produce  quite 
different  response  at  X  £  0. 
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ABSTRACT 

A  substantial  improvement  in  resolution  hat  been  achieved  for  die 
computation  cf  jump  discontinuities  in  gas  dynamics  using  die  method  of 
front  tracking.  The  essential  feature  at  this  method  is  that  a  lower 
dimensional  grid  is  fitted  to  and  follows  die  discontinuous  waves.  At  die 
intersection  points  cf  these  discontinuities*  two-tfimemfonsl  FJesann 
problems  occur.  In  this  paper  we  study  such  two-dhnensional  Riemann 
problems  from  both  numerical  and  theoretical  paints  cf  view.  Specifically 
included  is  a  numerical  solution  far  the  Mach  reflection,  a  general 
classification  scheme  for  two-dimensional  elementary  waves,  and  a 
discussion  of  problems  and  conjectures  in  this  area. 


1.  Introduction 

Many  phenomena  in  nature  are  modeled  by  nonlinear  hyperbolic  systems  of 
conservation  laws: 

«,  +  v/Ci.)-o.  (U) 

The  example  considered  here  is  the  system  cf  Euler  equations  for  a  compressible,  invisdd, 
polytropic  gas.  The  equation  (1.1)  represents  an  idealization,  ts  solutions  ate  die  limits, 
as  viscosity  parameters  teed  to  zero,  cf  the  solutions  cf  more  complete  equations  such  as 
the  Navier-Stokes  equations.  The  solutions  cf  interest  for  the  system  (1.1)  are  frequently 
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»ngle.  When  the  incident  shock  strikes  the  wedge  comer  a  reflected  shock  (R)  is  formed, 
which  extends  from  the  reflection  point  to  the  shock  tube  wall,  where  it  forms  a  bow 
wave  in  front  of  die  wedge,  as  shown  in  Fig.  2.1.  We  will  refer  to  die  region  enclosed  by 
the  reflected  shock  as  die  "bubble".  As  with  Rtemarm  problems  in  general,  die  solution 
is  self-similar,  ie  *(tX)  -  u(ot,<r£)  for  every  tx>0.  In  die  computation  presented  here, 
the  Mach  number  of  the  incident  shock  is  2.QS  and  the  angle  of  the  wedge  is  63.4°. 

The  numerical  calculation  was  initialized  just  after  the  reflected  shock  had  formed 
and  enclosed  only  a  small  region  about  one  quarter  of  a  mesh  interval  in  height  Data  at 
the  two  ends  of  the  reflected  shock  we*"  bed  using  a  shock  polar  analysis.  At  the 
wedge  comer,  frz  two  velocity  components  vanish.  The  remaining  solution  components 
at  the  coiner  and  the  full  solution  b  the  interior  are  determined  by  interpolation.  One 
arbitrary  parameter  is  used  b  the  initialization,  which  is  the  oblafcnest  of  the  bubble, 
defined  as  the  ratio  of  the  distance  of  the  reflection  point  from  the  corner  to  die  distance 
of  the  bow  shock  to  the  comer  (the  redo  of  the  lengths  of  die  segments  BC  and  AB  b 
Fig.  2.1).  The  initial  oblateness  was  taken  from  experimental  data,  but  it  can  be 
determined  approximately  by  a  preliminary  calculation  because  it  is  constrained  to  lie  b  a 
bounded  interval  by  theoretical  considerations.  In  fact  the  computational  results  are  quite 
insensitive  to  its  value.  The  initialization  algorithm  can  be  regarded  as  an  approximate 
solution  of  die  two-dimensional  Riemann  problem  b  a  special  case. 

In  Fig.  2.2  the  contours  of  constant  density  and  constant  entropy  that  were  obtained 
numerically  are  shown.  In  Fig.  23  the  density  distribution  along  die  wall  obtained  from 
the  calculation  is  superimposed  on  the  experimental  data. 

3.  Mach  Reflection  of  Shock  Waves 

The  intersection  point  of  (Escontinrtty  curves  will  be  called  a  node.  In  a 
neighborhood  of  a  node  the  curves  are  approximated  by  straight  lines  separating  wedge 
shaped  regions.  In  analogy  with  die  ooe-dhnensianal  Riemann  problem,  we  define  a 
two-dimensional  Riemann  problem  to  be  an  initial  value  problem  for  a  two-dimensional 
conservation  law  having  data  that  *s  either  a  constant  state  or  a  simple  rarefaction  wave  b 
each  of  £  finite  number  at  wedges.  Such  problems  have  been  studied  for  scalar 
conservation  laws  [8,5,6],  but  only  special  solutions  are  known  for  systems  of 
conservation  laws.  As  with  the  solution  at  a  one-dimensional  Riemann  problem,  die 
solution  of  a  two-dimensional  Riemann  problem  will  evolve  into  a  more  complicated 
configuration  containing  several  elementary  waves.  Thus  b  front  tracking  we  must  solve 
a  subcase  of  tbs  full  Riemum  problem:  determining  the  velocity  and  states  associated 
with  die  one  specific  elementary  wave  (node)  being  tracked.  We  shall  report  b  this 
secdon  on  a  method  for  tracking  the  Mach  node.  Its  propagation  is  a  fully  two- 
dimensional  problem  that  cannot  be  be  reduced  to  one-dimensional  problems  by  spatial 
operator  splitting,  fa  feet  successive  solutions  to  one-dhnensiooal  problems  still  play  a  key 
role  b  solution  of  die  Mack  node,  but  their  composition  is  governed  by  the  geometry  of 
the  waves  altering  die  node  and  not  fay  an  orthogonal  set  of  coordinate  axes. 


problem  for  the  Euler  equations.  From  a  mathematical  point  of  view  there  is  no  misting 
equation,  since  the  solution  from  the  previous  time  step  provides  complete  Cauchy  data. 

The  front  tracking  problem  is  to  obtain  a  complete  Riemann  problem  solution  for 
given  Cauchy  data  and  to  select  the  Mach  node  out  of  that  solution.  As  formulated  this 
problem  is  too  difficult.  Hence  we  proceed  with  equations  (3.1)-(3.4).  With  sixteen  state 
variables  and  eleven  equations  at  the  node,  we  see  that  die  Mach  node  lies  in  a  five- 
dinunsional  manifold  within  the  space  of  states  x0,  *«,  u2,  and  uy  In  lieu  of  solving  a 
full  two-dimensional  Riemann  problem  to  select  a  paint  in  this  manifold,  five  of  the  above 
sixteen  parameters  are  selected  to  specify  the  node.  Using  physical  intuition,  we  selected 
five  parameters  from  the  complete  set  of  Cauchy  data  [4].  Baaed  on  numerical  evidence, 
v.'s  believe  this  method  does  a  satisfactory  job  of  picking  oat  die  Mach  node  from  die 
waves  emanating  from  the  complete  solution  of  a  two-dimensional  Riemann  problem  that 
is  close  to  a  Mach  node. 

We  compared  the  numerical  solution  for  the  nonsteadjy  Mach  reflection  with 
experiments  in  [3].  In  the  experiment  an  incident  abode  with  a  Mach  number  2.3? 
impinges  on  a  wedge  with  angle  27°.  The  calculations  were  initialled  just  after  the  Mach 
configuration  has  appeared.  The  reflected  shock  and  tire  Mach  stem  then  enclose  a  region 
of  about  one  mesh  interval  in  height  After  the  bubble  enckaed  a  region  several  mesh 
intervals  in  height  the  solution  had  settled  down  to  its  self-similar  form  as  seen  in  the 
experiment.  In  Fig.  33  die  constant  density  contours  are  shown.  In  Fig.  3.4  foe  wall 
density  distribution  obtained  in  our  calculation  h  superimposed  on  the  experiments!  data. 
However  the  calculation  is  preliminary  in  two  respects.  The  present  farm  of  the  algorithm 
for  the  propagation  of  the  Mach  triple  point  seems  to  be  stable  only  when  the  initial 
oblateness  is  chosen  near  the  experimentally  determined  value.  Moreover  the  algorithm 
for  the  propagation  of  the  point  of  intersection  of  the  contact  discontinuity  with  the  wall  is 
diacemably  unstable:  the  fluid  velocity  at  this  point  is  very  sensitive  to  die  pressure 
upstream,  so  the  end  of  the  contact  tends  to  curl  up.  The  causes  of  these  -notabilities  have 
not  yet  been  determined. 

4.  The  Classification  of  Two  Dfaaemfoaal  Elementary  Waves 

In  this  section  we  classify  the  elementary  waves  for  two-dbaensiooal  gas  dynamics. 
Front  tracking  employs  a  normal  and  a  tangential  operator  splitting  at  jump  surfaces  and 
a  solution  of  two-dimensional  Riemann  problems  at  tile  paint  singularities  formed  fay  the 
intersection  of  jump  surfaces  (“nodes”).  An  example  of  such  a  two-dimensional  Riemann 
problem  was  described  in  the  previous  section.  For  two-dbneasionai  compressible  gas 
dynamics  there  are  oily  a  small  number  of  such  nodes. 

We  make  some  general  assumptions  that  idealiwe  die  problem,  but  which  we  believe 
apply  to  a  generic  set  of  possible  point  singularities  formed  by  tile  meeting  of  jump 
surfaces  and  asnteied  rarefaction  waves.  Then  we  refine  there  general  raromptions  into  a 
precise  mathematical  formulation,  and  using  the  latter,  derive  a  classification  scheme  for 
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the  allowed  point  Angularities. 

Excluded  from  this  clarification  scheme  are  paint  singularities  formed  by  centered 
waves  (implosions)  and  points  in  a  neighborly  J  of  which  the  solution  is  not  piecewise 
smooth. 

Definition  4.1  A  pressure  wove  is  a  shock  wave  or  a  centered  rarefaction  wave.  A 
wave  is  either  a  pressure  wave  or  a  contact  discontinuity.  A  node  is  the  point  singularity 
formed  by  the  intersection  of  waves.  A  rarefaction  wave  centered  at  a  node  is  called  sn 
incoming  (forward  facing)  rarefaction  wave  if  its  straight  line  C+  or  C"  characteristic*,  in 
the  i'rame  in  which  the  node  is  stationary,  point  towards  the  node.  A  shock  wave 
emanating  from  a  node  is  said  to  be  an  incoming  shock  wave  if,  in  the  rtationary  frame,  it 
turns  the  flow  towards  the  node.  Similarly  we  define  an  outgoing  rarefaction  wove  and  an 
outgoing  shock  wove.  We  observe  that  every  pressure  wave  at  a  node  is  either  incoming 
or  outgoing. 

Aanmptloa  4  J.  We  assume  our  solution  a  to  be  an  elementary  wave,  which,  in 
general  terms,  satisfies  the  following: 

4.2.1  a  is  a  stationary  solution  of  the  Euler  equations  for  a  polytropic  gas:  a,  *  0 

and  V*/(a)  0; 

4.2.2  a  has  the  form 

a  *  uj  for  hj-i  <•<•/,  j  m  1 , . . . ,  * 
where  6„  =  0O  +  2ir  and  each  uj  is  constant  or  a  centered  rarefaction  wave; 

4.2  J  the  only  jumps  allowed  in  a  are  shock  waves  and  contact  discontinuities; 

4.2.4  a  is  generic; 

4.2.5  a  is  an  entropy  increasing  solution,  with 

a  *  Jim  a  , 

v-0 

where  a  is  a  solution  of  the  Navier-Stokcs  equations  with  viscosity  v. 

First  consider  all  possible  elementary  waves  containing  only  contact  dbooctinuities. 
If  one  of  the  sectors  has  a  180°  opening,  there  can  be  a  nonzero  fisoontinulty  in  the 
!*j>$$Mtial  velocity  across  its  boundary.  AH  other  contact  discontinuities  contain  only 
temp£3»ture  jumps.  It  is  possible  for  any  number  of  them  to  occur,  and  at  any  set  of 

ang&Ss. 

From  new  on  assuror  that  the  elementary  wave  contains  at  least  our  '  xssure  wave. 
Assumptions  4.2.4  and  4.2J  are  not  written  in  mathematical  terns,  so  wr  mulate  the 
ideas  that  they  express  in  a  manner  that  we  can  use  in  our  analysis. 

4.2.4a  No  incoming  rarefaction  waves  are  allowed.  There  cart  be  at  mew  two 
incoming  pressure  waves,  which  are  necessarily  shock  waves.  If  the  flow  on 
the  ahead  Ade  (the  Ade  with  the  lower  pressure)  of  an  incoming  abode  wave  is 
adjacent  to  a  contact  fisoontinuity  that  is  within  90°  of  the  booming  abode. 
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only  this  one  incoming  shock  wave  is  Allowed. 

4.2.5*  No  sectors  of  zero  velocity  bounded  by  contact  discontinuities  (“embedded 
wedged)  are  allowed. 

Observe  that  the  elementary  waves  that  satisfy  4.3.1  trough  4.2.4  but  fail  to  satisfy 
4.2  Ja.  can  be  interpreted  as  solutions  of  an  initial/bounduy-value  problem  if  the  contact 
discontinuities  bounding  the  embedded  wedge  are  replaced  with  reflecting  walls. 

In  [4]  we  have  determined  that  an  elementary  wave  containing  a  pressure  wave  is 
restricted  by  the  following.  There  is  a  unique  streamline  through  the  no&.  There  is 
either  one  or  two  incoming  shock  waves.  There  is  no  more  than  one  outgoing  pressure 
wave  on  each  side  of  the  athjunline  leaving  the  node,  only  one  of  which  may  be  a 
rarefaction.  The  streamline  is  a  contact  discontinuity  if  there  are  two  outgoing  pressure 
waves.  This  then  leaves  ue  with  only  a  small  number  of  possible  elementary  waves,  which 
we  shall  determine  presently. 

Theorem  4.3.  Under  Assumptions  4.2  an  elementary  wave  containing  at  feast  one 
pressure  wave  is  one  of  the  following  types,  as  specified  in  detail  below:  crass,  overtake, 
Mach,  diffraction,  and  transmission. 

What  follows  is  an  explanation  of  die  types  together  with  figures.  For  a  proof  of  die 
theorem  see  [4]. 

Diffraction.  T^e  diffraction  of  a  shock  impinging  on  a  contact  discontinuity,  causing 
a  reflected  and  a  transmitted  shock  is  a  possible  solution,  ace  tig  4.1.*.  We  show  in  Fig. 

4.1.  b  bow  the  solution  may  be  constructed  by  drawing  die  appropriate  shock  polar*  in  the 
0  ,p  plane. 

Diffraction,  The  configuration  in  Fig.  4,2.a  is  an  elementary  wave,  ft  is  as  in  Fig. 

4.1.  a,  but  with  a  rare&ctios  wavs  in  place  of  a  reflected  shock.  The  solution  is 
constructed  using  shock  polar*,  as  indicated  in  Fig.  4.2.b. 

Transmission.  A  shock  Impinging  on  a  contact  discontinuity  and  causing  a 
transmitted  shock  but  no  reflected  wave  (ice  Fig.  43.a)  is  a  possible  elementary  wave;  see 
the  corresponding  shock  polar  in  Fig.  43.b. 

Meet  node.  A  direct  Mach  reflection,  where  the  incident  shock  breaks  into 
shocks,  the  reflected  shock  and  the  Midi  stem,  is  a  possible  elementary  wave;  see 
4.4.a.  The  solution  is  found  using  shock  pdara  as  in  Fig  4.4.b. 

Overtake,  ft  is  possible  to  have  two  incoming  shocks  overtake  cash  other  and 
rise  to  two  outgoing  shocks  separated  by  a  contact  discontinuity  (see  Fig.  4J5.a). 
solution  may  be  constructed  using  shock  polar*  as  in  Fig.  43.b.  The  special  case  where 
one  of  the  two  incoming  shriek*  has  zero  strength  coincides  with  the  Mach  nods  case. 

OverMe.  One  shock  may  overtake  the  other,  rewiring  in  a  reflected  rarefaction 
and  a  transmitted  shock  (see  Fig  4.6.a).  The  solution  is  found  using  shod:  polar*  as  )r 
Fig.  4  >.b.  Note  that  for  the  same  parameter*  of  the  two  incident  shocks,  both  this  case 
and  die  prc^Tus  case  are  possible. 
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Cro«  or  Mach  node.  Two  incident  abodes  colliding  to  form  cwo  reflected  shocks 
separated  by  a  streamline  (see  Fig.  4.7.a)  is  a  possible  elementary  wave.  The  solution  is 
found  using  shock  paters  as  in  Fig.  4.7.b.  The  special  case  of  one  incident  shock  laving 
zero  strength  gives  die  direct  Mach  node.  A  single  shock  wave  in  tbs  outgoing  sector 
defines  the  inverted  Mach  node;  this  interaction  is  a  limit  of  the  cross  case  above,  where 
the  shock  between  region  2  and  3  in  Fig.  4.7.a  1  educes  to  zero. 

3.  Seme  Problems  and  Conjectures  Concerning  Riemann  Problems 

In  this  section  we  drop  the  restriction  to  two  dimensions  and  to  gas  dynamics,  but  we 
retain  the  terminology  of  Sec.  4.  Recall  that  in  steady  supersonic  two-dimensional  gas 
dynamics,  where  the  direction  of  the  flow  defines  a  timelike  direction,  the  equations  can 
be  reduced  to  the  form  of  a  one-dimensional  time-dependent  system  of  conservation  tews. 
Then  the  two-dimenskmcl  elementary  waves  viewed  in  the  stationary  frame  are  Riemann 
problems  for  a  distinct  but  related  one-dimensional  system.  Similarly  Riemann  problem 
solutions  in  n-1  space  dimensions  are  qualitatively  similar  to  elementary  waves  in  n 
dimensions. 

We  list  some  problems  of  general  interest  in  this  area. 

1.  The  possible  n -dimensional  elementary  waves  for  a  system  of  conservation  tews 
could  be  classified.  The  elementary  waves  in  two-dimensional  pofytropic  gas  dynamics 
were  classified  in  the  previous  section. 

2.  Let  the  incoming  wave  operator  be  the  solution  operator  bringing  two  or  more 
elementary  waves  to  a  single  point  and  thereby  defining  a  Riemann  problem.  The 
incoming  wave  operator  also  acts  on  single  elementary  waves  by  mapping  to  the 
configuration  at  a  time  of  bifurcation,  or  dynamic  instability;  this  nlso  defines  a  Riemann 
problem.  The  range  of  this  operator  is  limited  to  the  possible  mergers  or  bifurcations  of 
the  elementary  waves  found  in  the  classification  above.  Can  this  range  be  categorized? 

3.  The  outgoing  wave  operator  gives  the  possible  elementary  waves  that  may  occur 
in  the  solution  of  the  Riemann  problems  in  the  range  «f  the  incoming  wave  operator.  We 
pose  the  question  of  existence  of  solutions  for  this  restricted  set  of  data.  Are  solutions 
piecewise  smooth,  so  that  there  is  a  finite  number  of  outgoing  elementary  waves?  The 
answer  depends  on  die  order  of  the  system,  the  dimension  of  space,  and  the  convexity  or 
number  of  inflection  points  in  the  flux  function,  as  examples  [6]  show  and  analngtea  [7] 
suggest. 

4.  A  logical  scattering  matrix  -  is  a  map  from  sets  of  incoming  wave  types  to  sets  of 
outgoing  wave  types  as  labeled  by  the  solutions  to  problem  1.  It  decides  which  types  of 
incoming  waves  produce  which  types  of  outgoing  waves.  In  the  language  of  quantum 
mechanics,  the  problem  here  is  to  classify  the  possible  S  matrix  graphs.  Let  us  consider 
this  problem  from  the  point  of  view  of  two-dimensional  gas  dynamics.  We  restrict 
attention  to  Riemann  data  contained  in  the  range  of  the  incoming  wave  operator  a* 
defined  in  2  above.  Under  such  restriction,  the  waves  will  be  said  to  be  in  inmrdng 
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order.  The  allowed  nodes  of  See.  4  provide  interchange  of  wave  order  to  an  outgoing 
order.  In  general,  however,  the  interchange  of  wave  order  produces  three  outgoing  waves 
from  two  incoming  waves  and  need  not  reduce  the  total  cumber  of  wave  pahs  that  fail  to 
be  in  outgoing  order.  On  this  basis,  we  expect  that  even  simple  incoming  roafiguratkna 
could  produce  complicated  outgoing  wavs  interactions.  It  is  powible  that  the  complication 
(e.g.  the  number  of  nodes),  while  no'  bounded  a  priori,  is  still  finite.  In  fact  the  wave 
interactions  typically  decrease  the  Mach  number  of  the  flow,  and  may  give  rise  to  a 
subsonic  region,  inside  of  which  no  pressure  waves  ^an  occur.  Related  to  this  possibility 
is  the  occurrence  of  nodes  with  only  two  outgoing  waves.  Such  nodes  allow  the 
interchange  of  wave  order  with  a  reduction  in  the  total  number  of  pain  that  are  out  of 
order. 

5.  Uniqueness  is  an  open  problem.  Well  known  problems  of  noouniqueae*  are  not 
understood  on  a  fundamental  level.  For  example,  consider  an  incident  shock  hitting  a 
wedge,  resulting  in  either  a  regular  reflection  or  iviaeh  reflection.  There  are  region 
where  both  solutions  are  possible.  By  introducing  additional  physical  effects  such  as 
viscosity,  with  a  resulting  boundary  layer,  or  surface  roughness  on  a  certain  length  scale, 
this  overlapping  region  of  nommiquenes*  might  disappear. 

6.  Extended  or  nonlocal  Riemann  problems  may  be  considered,  where  die 
restriction  of  constancy  in  sectors  between  the  waves  is  replaced  by  linear  or  higher  order 
data.  This  has  been  implemented  few  one  dimension  in  the  normal  propagation  of  die 
front  [1]. 

7.  Lower  order  terms  in  the  equations  and  new  waves  in  the  Riemann  solution  may 
be  caused  by  geometrical  effects  and  by  external  sou  rets. 

8.  The  geometry  in  the  large  defined  an  the  state  space  by  die  flux  function  needs 
to  be,  understood.  For  gas  dynamics  the  qualitative  behavior  of  solutions  may  be  studied 
by  considering  the  acoustic  waves  of  the  linearized  problem.  But  this  is  not  die  case  for 
all  hyperbolic  conservation  laws,  atid  new  families  cl  waves  may  be  passible  in  the  large. 
The  topology  defined  by  the  critical  paints  of  the  flux  function  /  in  (1.1)  is  important 
here.  A  critical  point  is  a  point  in  state  space  where  the  gradient  A  *»  V  /  has  oabsddbg 
eigenvalues.  At  tile  critical  points  equation  (1.1)  is  no  longer  strictly  hyperbolic; 
moreover  A  can  fail  to  have  a  complete  set  of  eigenvectors.  Such  a  low  cf  strict 
byperbolidty  is  not  pathological  in  applications,  and  die  mathematical  consequences  of 
this  fact  have  not  been  developed.  An  extension  of  this  phenomena  are  the  problems  for 
which  the  equations  in  tSf&rent  regions  of  state  space  change  type.  The  applications  are 
of  general  interest:  ail  reservoir  simulation,  transonic  Cow  in  gas  dynasties,  chemically 
reacting  flows,  and  noGline&r  elasticity. 
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FI*.  2.1.  Regular  reflection  of  a  shock  wave  by  a  wedge.  A  vertical  shock  I  has  struck  a 
63°  wedge  from  the  left,  causing  a  reflected  shock  R,  which  forms  a  bowsbock  in  front  of 
the  wedge. 


Fig.  2.2.  The  numerical  simulation  of  a  regular  reflection,  where  the  incident  shock  has 
Mach  number  2.0$  and  the  wedge  angle  is  63.4°.  The  calculation  was  performed  on  a  80 
by  20  grid.  The  top  picture  shows  the  lines  of  constant  density  inside  the  bubble  formed 
by  the  reflected  shock.  The  bottom  picture  chows  the  lines  of  constant  entropy.  They 
should  coincide  with  the  integral  curves  for  the  self-similar  velocity  field.  Theoretical 

arguments  in  the  text  suggest  that  these  integral  curve*  ail  terminate  at  the  comer. 
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Fig.  3.2.  Hie  Riemann  problem  corresponding  to  the  tripie  point  obtained  in  a  Mtch 
reflection.  We  assume  the  shocks  and  contact  discontinuities  are  straight  line*  (labeled  as 

in  Fig.  3.1)  with  constant  states  0  through  3  in  the  wedges. 


distance  along  u-a U 


Fif-  3.4.  Hie  density  distribution  along  the  wall  of  the  Mach  reflection  ran  (solid  line) 
shown  superimposed  on  the  data  found  experimentally  by  Desch&mbault  and  (dots). 
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Flj.  *•*•*•  Infraction.  A  shock  /  ora  diffract  through  a  contact  cfiicmtinuity  C  to 
cause  a  reflected  shock  if  and  a  transmitted  ^  7. 


F%.  4.1.b.  The  shock  poiare  correspoctfing  to  Fig.  4.1  ji.  The  Mach  number  of  state  0  is 
2.7  and  that  of  state  0*  is  3.  The  shock  strength  of /is  3. 


Fig.  4.3.*.  Transmission.  A  shock  /  incident  on  *  co 
transmitted  shock  T  but  no  reflected  wave  is  possible. 


Fig.  4.3.b.  The  shock  polars  correspandiag  to  Fig.  4J. 
of  state  0  is  3  and  that  of  state  if  is  2.7. 
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MIGRATION  OF  TKE  GAS  GLOBE  FROM  UNDERWATER  EXPLOSIONS: 
THE  EFFECTS  OF  DRAG  AND  RADIATIVE  ENERGY  LOSS 


K.C.  Heaton 

Weapon  Systems  Section,  Armaments  Division 
Defence  Research  Establishment  Valcartier 
P.0.  Box  8800,  Courcelette 
Quebec,  GOA  IRQ 


ABSTRACT .  In  this  work,  the  problem  of  the  motion  of  che  bubble  from 
an  underwater  explosion  is  considered.  Previous  derivations  of  the 
equations  of  motion  are  summarized.  It  is  shown  how  a  Lagrangian 
method  can  be  utilised  to  obtain  the  equations  of  motion,  and  how 
generalised  dissipative  forces  can  be  formally  incorporated. 

The  generalised  dissipative  force  for  energy  loss  by  radiation 
of  sound  is  obtained  from  first  principles,  and  found  to  be 
approximately  proportional  to  the  cube  of  the  rate  of  change  of  the 
bubble  radius  with  respect  tc  time. 

The  algorithm  by  which  the  equations  of  motion  were  solved 
numerically  is  briefly  discussed,  and  some  results  of  the  computations 
presented. 

It  is  found  that  the  approximate  formula  for  energy  loss  by 
radiation  of  sound  by  a  bubble  in  the  absence  of  vertical  motion 
(derived  by  Herring)  gives  results  similar  to  those  obtained  using  the 
mo  ~  exact  formulation  developed  in  this  work.  It  is  also  found  that 
ti  'te  of  energy  loss  is  affected  slightly  by  processes  analogous  to 
ra^.  .cion  reaction  in  electromagnetic  theory. 

Results  are  also  presented  for  the  cases  for  which  energy  is 
lost  by  drag  as  well  as  by  radiation  of  sound,  and  for  the  case  for 
which  the  effects  of  free  and  rigid  surfaces  on  the  bubble  motion,  but 
not  energy  loss,  are  considered. 
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I.  INTRODUCTION.  One  of  the  more  prominent  phenomena  associated  with 
underwater  detonations,  after  the  initial  shock  produced  by  the  explo¬ 
sion  itself,  is  the  formation  of  a  bubble  of  gaseous  explosion  prod¬ 
ucts.  This  bubble  rises  toward  the  surface  of  the  water,  responding  as 
it  does  so  to  changes  in  the  external  pressure  distribution  with  radial 
oscillatory  motion  during  the  course  of  which  it  loses  some  of  its 
energy  through  the  emission  of  sonic  pulses.  Although  the  bubble  may 
have  been  initially  spherical,  the  effect  of  Its  upward  motion  distorts 
it  into  a  non-spherical  shape,  which  becomes  most  pronounced  in  the 
neighbourhood  of  t.te  minimum  bubble  radius.  The  alteration  in  the 
bubble’s  shape  further  affects  both  the  pulsations  and  the  upward 
translational  motion  of  the  bubble.  By  means  of  finite  element  tech¬ 
niques,  the  equations  of  motion  of  the  bubble  can  be  solved,  taking 
into  account  the  effects  of  the  changing  shape  of  the  bubble,  although 
the  amount  of  computing  time  required  by  this  method  limits  its 
utility.  Finite  element  methods  have  a  further  disadvantage  in  that 
physical  insights  into  the  systems  considered  are  rather  more  difficult 
to  come  by  than  might  otherwise  have  been  the  case. 

Herring  (1942")  and  others  (eg.  Taylor  (1942)  and  Shiftman  and 
Friedman  (1944))  have  treated  the  problem  of  the  motion  of  the  bubble 
by  considering  it  to  be  a  perfect  sphere  throughout  its  entire  motion. 
This  treatment  yields  values  for  the  periods  of  radial  pulsations  of 
the  bubble  which  are  in  good  agreement  with  experimental  data,  but 
predicts  a  much  more  rapid  movement  toward  the  surface  than  that  which 
is  actually  observed.  This  arises  because  the  largest  upward 
velocities  of  the  bubble  occur  at  those  times  when  the  bubble  is  near 
its  minimum  radius;  it  is  precisely  then  that  the  largest  departures 
from  sphericity  have  been  observed.  Hicks  (1972)  added  a  drag  term  to 
the  equations  of  motion,  chosing  the  drag  coefficient  such  that  the 
predicted  distance  travelled  by  the  bubble  was  in  agreement  with  that 
which  had  been  observed. 


Iu  this  paper,  previous  formulations  of  the  problem  of  the  motion 
of  an  underwater  bubble  are  discussed.  The  ways  in  which  the  effects 
of  drag,  as  well  as  those  of  the  energy  radiated  by  a  bubble  during  its 
motion,  can  be  incorporated  into  its  equations  of  motion  are 
considered.  A  more  rigorous  derivation  of  the  dissipation  function  for 
the  bubble  is  presented.  The  aquations  of  motion  incorporating  the 
effects  of  gravity  and  loss  of  energy  by  drag  and  the  radiation  of 
sound  by  the  bubble,  are  derived  using  a  Lagrangian  formalism.  The 
algorithm  by  which  various  forms  of  the  equations  of  motion  were  solved 
Is  briefly  discussed.  Some  computational  results  are  presented,  along 
with  suggestions  for  the  further  extension  of  this  work. 
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I I .  EQUATIONS  OF  MOTION  FOR  A  SPHERICAL  BUBBLE. 

II. 1  Review  of  Previous  Work.  Taylor  (1942)  derived  equations  des¬ 
cribing  the  motion  of  a  spherical  bubble  of  gas  undergoing  both  radial 
pulsations  and  translational  motion  toward  the  water  surface.  These 
are: 

W(^)2  +  p3”2  +  ^-pa3gz  -  V  -  E(a),  [1J 


Is; 

3  J 
a  o 


[2] 


where  a  is  the  radius  of  the  bubble  as  a  function  of  the  time  t,  U  its 
upward  velocity,  z  the  position  of  the  bubble  below  the  pressure  datum 
(i.s.  below  the  zero  pressure  level),  E(a)  the  internal  energy  of  the 
gas  comprising  the  bubble,  p  the  density  of  the  water,  g  the 
gravitational  acceleration,  and  Y  the  total  energy  of  the  bubble.  In 
Taylor's  formulation,  there  was  no  mechanism  included  for  energy  loss, 
and  hence  Y  was  taken  to  be  a  constant*  For  TNT  explosions,  it  has 
been  found  (Herring  1942)  that  approximately  50Z  of  the  total  explosion 
energy  is  retained  by  the  bubble;  in  that  case: 


Y  =  (1.85  x  1010)M 


[3] 


where  Y  is  measured  in  ergs,  and  M,  the  original  mass  of  the  explosive 
charge,  is  given  in  grams.  If  one  assumes  that  the  gaseous  explosion 
products  obey  the  ideal  gas  law,  then  the  internal  pressure,  P,  Is 
given  by: 


P  =  k(Pg)Y  [4] 

where  p  is  the  density  of  the  explosion  products  and  y  the  ratio  of 
specific  heats.  Assuming  that  the  entire  mass,  M,  of  the  explosive  has 
been  converted  to  gasn 


M 


3 

and  hence: 

CO 

E(a)  =  /  PdV 
a 


[5] 


[6] 
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Y  -3(Y-1) 


kM  a 


2 

where  dV  ■  4* a  da. 


Taylor,  using  the  work  of  Jones,  set: 


k  -  7.83  x  109  , 

Y  -  1.25 


[7] 


for  TNT  where,  in  eq. 


[6],  E(a)  is  measured  in  ergs  and  p  in  go/cm3. 

8 


Herring  (1942)  derived  an  equation  of  motion  for  the  bubble, 
neglecting  the  effects  of  gravity  and  transl  tional  motion  U,  but  in¬ 
corporating  the  effects  of  the  loss  of  energy  by  the  radiation  of 
sound.  This  is: 


2  2  2  2 
da.  3  r  da%  1  d  rj  /  da-\  i  a  dPa<  1  —  1  da^ 

3  ~2  +  2  1  dP  “  d?-  “  pi  dt"^  cdF 

at 


[8] 


where  Pa  is  the  gas  pressure  in  the  buble  when  its  radius  is  a,  c  is 
the  speed  of  sound  in  the  water,  P  the  hydrostatic  pressure,  and  all 
other  variables  are  as  defined  previously.  Herring  has  shown  that  the 

energy  loss,  which  is  contained  in  the  term  - — ~~  f1  —  4r)»  is 

pc  dt  v  c  dt' 

significant  only  during  the  times  at  which  the  bubble  is  near  its 
minimum  radius.  Using  the  values  of  k  and  Y  from  eq.  [7]  for  TNT.,  cne 
finds  that  the  loss  of  energy,  AY,  over  one  cycle  of  radial  pulsation 
of  the  bubble,  is  approximately: 


P  k 
AY  .  R7  rati 
~  1,87 


(?) 


where  P  .  is  the  gas  pressure  at  the  minimum  radius  .he  bubble 
ran 

during  that  cycle. 

Finally,  the  presence  of  the  water  surface  and  the  ocean  bottom 
has  an  effect  on  the  bubble  motion.  Using  the  theory  of  images, 
Shlffman  and  Friedman  (1944)  have  modified  Taylor's  equations  of  uotion 
for  the  bubble,  thusly: 


2rrp aJ(l  +  fQ)(^)  -  ^npa^U  ~  +  jpa3(l  +  3f2)U 


4ir  3 

+  —  pa  gz  -  Y  -  E(a)  , 

^  *  «2>»  -  if]  ‘  -  ±2  hr  (H)2 

4D 


1  df2  ..2 


,  j.  z  t,z  -  1  dai  ,2  3 
+  7aTU  ~  2  hTT  U  hTJ  +T?a 


where  fQ,  f^,  f 9  are  infinite  series,  b  is  the  height  of  the  centre  of 
the  bubble  above  the  ocean  floor,  and  a  is  given  by: 


(i.e.  the  ratio  of  the  bubble  radius  to  twice  the  distance  from  tLa 
bubble  centre  to  the  ocean  floor).  The  leading  terms  of  the  series  fQ, 
f,,  f£  are: 


r  ^  1  * 

fo“a  +  2°  ’ 


,  1  2  1  5 

fl  •  2a  +  2a  > 


.  1  3  .  1  6 

f2  "  2  °  +  2  °  * 


The  preceeding  suggests  an  approximate  method  for  determining  the 
amount  of  energy  lost  through  radiation  of  sound  by  a  spherical  bubble 
undergoing  both  translational  motion  and  radial  pulsation.  In  this 
procedure,  one  ignores  the  loss  of  energy  until  just  after  the  bubble 
has  passed  its  minimum  radius,  then  subtracts  the  energy  radiated  away, 
using  eq.  [9],  and  proceeds  with  the  integration  until  the  next  minimum 
is  reached.  In  other  words,  one  Integrates  Taylor's  eqs.  [1-2),  keep¬ 
ing  Y  constant,  with  its  value  being  given  by  eq.  [3J.  After  the  bub¬ 
ble  has  gone  through  -its  minimum  radius,  one  replaces  Y  in  eq.  [1]  by 
Y',  where: 
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[14] 
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r  =  y  ( 1  -  ^|) 

and  AY  is  given  by  eq.  [9].  One  then  repeats  this  procedure  whenever 
the  bubble  radius  passes  through  a  local  minimum,  using  the  new  value 
of  Y,  Y\  in  aq.  [9]. 

Hicks  (1972)  incorporated  a  drag  force,  Fp,  into  the  equations  of 
motion,  where  Fp  is  given  by: 

F„  -  |cD,pa2U2  [15! 


and  the  value  of  the  drag  coefficient,  Cp,  was  chosen  to  be  "  2.25 

in  order  to  bring  the  distance  travelled  upward  by  the  bubble  at  its 
first  maximum  into  agreement  with  that  actually  observed  for  500  lbs  of 
TNT  detonated  150  ft  below  the  surface. 

By  differentiating  [2],  he  obtained  the  rate  of  change  of  momentum 
with  respect  to  time,  and  equated  that  to  Fp.  The  momentum  equation 

then  would  then  become: 


,  3  3  „  2  2 

2a  g  -  4  cDa  U 


The  energy  dissipated  by  drag  forces  is  given  by: 


[16] 


dE  1  2  3 

d?  =  2Vpa  ° 


[171 


The  rate  of  energy  loss,  -dE/dt,  wes  then  equated  to  the  time  deriva¬ 
tive  of  [1]  to  obtain  the  energy  equation: 


pa3gz  +  E(a)) 


=  -  -|cDirpa2U3  .  [18] 

It  should  be  noted  that  eqs.  [16]  and  [18]  are  not  precisely  identical 
to  those  of  Hicks,  since  they  neglect  his  correction  for  the  effect  of 
the  water  surface. 
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II . 2  Equations  of  Motion  including  Energy  Loss  by  Drag  and  Radiation 
of  Sound.  The  method  described  in  the  previous  section  for  determining 
the  energy  loss  through  radiation  of  sound  by  a  bubble  has  a  number  of 
drawbacks,  not  the  least  of  which  is  that  the  subtraction  of  energy 
from  eq.  [1]  at  the  minimum  radius  can  adversely  affect  the  convergence 
of  numerical  solutions  to  the  system  of  differential  eqs.  [1-2] .  As 
well,  the  energy  loss  expression,  eq.  [9],  was  expressly  derived  under 
the  assumption  that  any  effects  of  gravity  and  translational  motion  on 
the  energy  loss  would  be  negligible;  one  really  has  no  a  priori  reason 
for  believing  this  to  be  the  case. 

For  these  reasons,  there  seems  to  be  a  need  for  a  treatment  which 
produces  a  more  rigorous  formulation  of  the  equations  of  motion  of  a 
bubble  in  the  presence  of  gravity,  incorporating  the  effects  of  the 
radiation  of  energy  by  sound,  and  which  can  be  easily  extended  to  in¬ 
clude  other  forms  of  energy  loss,  such  as  drag.  Such  a  treatment  is 
presented  in  this  section. 

The  kinetic  energy,  T,  of  the  water  surounding  a  sphere  of  radius 
a  moving  with  an  upward  translational  velocity  U,  is  given  by: 

T  -  2*pa3(^)2  +  jPa3U2  [19] 

(Taylor  1942,  Cole  1948).  The  potential  energy,  V,  is  given  by: 

V  -  E(a)  +  Vp  [20] 

where  E(a)  is,  as  before,  the  internal  energy  of  the  bubble,  eq.  [6], 
and  Vp,  the  energy  associated  with  the  hydrostatic  pressure  around  the 

bubble: 


and  the  Hamiltonian,  H: 


H  -  T  +  V 


-  3  /  da>  ,  if  3, 2.  ,  4ir  3 

2&p  a  (.~J  +  ^palr  +  pa  gz 


vhy  .T**3 

(T-l)  (?-)Y‘1 


[23] 


The  equations  of  motion  of  the  bubble  are  therefore  giver,  by: 


-  §7  -  V 

St^  -  &  ■  ^ 


(24) 


•  ds.  •  dz  ' 

where  a  *  -rr,  z  m  ~rr  m  -U,  and  the  Q.  are  the  non-conservative 
at  at  x 

generalized  forces. 

Evaluating  [24],  one  obtains,  as  equations  of  motion  for  a 
bubble : 


4*  pa 


3  d^a 
dt2 


+  6irpa2(~)  -  irpa^U 


2„2 


+  4*pa  gz 

Q. 


-  *MY  m-3r+2  . 

3 


(^)Y 


d  ,  3m  _  3  "z 

“<aD>  2,e  '  (£) 


[25] 


[26] 


As  might  have  been  expected,  when  Qfl  ■  *  0,  i.e.  when  there  is 
no  change  in  the  total  energy  of  the  system,  eqs.  [25]  and  [26]  can  be 
integrated,  and  yield  Taylor’s  eqs.  [1]  and  [2]. 

In  order  to  complete  the  derivation  of  the  equations  of  motion  of 
the  bubble,  it  is  necessary  to  determine  Qa  and  Qz. 


Now,  the  velocity  potential,  $,  is  given  by: 


$  "  $ i  +  $2 


[27] 


where 


2  , 
a  da 

r  dt 


11 1) 

r 


[23] 


.  1  a  „ 

$  2  ~  ~2  ~~2  U  COS 
r 


V*(A(t) 


cos  0 


n) 


[29] 


where  r  and  ©  are  the  radial  and  angular  co-ordinates,  respectively, 
shown  in  Fig.  1  and  n  is  the  unit  vector  normal  to  the  bubble  surface 
(e.g.  Cole  1948).  Since  the  bubble  is  explicitly  assumed  to  remain 
spherical,  n  =  ?,  where  r  is  the  unit  radial  vector. 

Following  a  development  similar  to  that  of  Landau  and  Lifshitz 
(1966),  in  the  wave  zone: 


[30] 


where 


t' 


t  - 


[31] 


and  c  is,  as  before,  the  velocity  of  sound  in  water.  The  velocity,  V, 
of  the  water  in  the  wave  zone  luust  therefore  be  given  by: 

V  »  $  (J> 


f  1_  3f(t)  _  cos  e  32A(th  a  ^ 

lcr  3 1  2  .  2  J 

rc  3t 


[32] 


where  terms  of  higher  negative  order  in  r  have  been  ueglected. 


dE 


The  total  energy  emitted  as  sonic  radiation  per  unit  time,  — ,  is 


then: 
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[33] 


where  the  surface  integral  has  been  taken  over  a  sphere  of  radius  r. 


To  a  good  approxir-ition,  the  term  in  eq.  [33]  proportional  to  — ^  can  be 

c 


neglected  for  low  translational  velocities,  since  it  will  be  2  orders 
of  magnitude  smaller  than  that  proportional  to  1/c. 


Now, 


3f(t) 

at 


2  d^a  ,  ^  fda-v" 
a  +  2a  l-jrJ 


dt 


[34] 


and  hence: 


dE 

dt" 


4rrp 


4  2  2 

r ,  2fda,»  ,  ,  ?  'dai  d  a  , 

l  *a  t  jrJ  +  **  nrJ  —  + 


dt 


/  a2  2 

A4-*)  ) 

dt 


[35] 


Now,  in  view  of  the  foregoing,  it  would  seem  reasonable  to  conclude 
that,  in  the  absence  of  drag  forces,  the  dissipative  forces  depend 
solely  upon  the  rate  of  change  of  the  radius  of  the  bubble,  and  hence 
that: 


Q  =•  0 
z 


[36] 


In  that  event,  it  is  possible  to  write  that: 
aQ_  ! 


dE 

dt 


[37] 


and  hence: 


-  4rrp—  Q  -  a  — 

r  _  • 


3Q 

•  s 

da 


[38] 
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Qa  =  [l6(£)3+  8a(ff)4  +(Aa(f): 

dt 


+  2a2^|) 

dt  3  a  dt 


By  substituting  eqs.  [36]  -  [39]  into  eqs.  [25]  and  [26],  one  obtains: 


d  a  2  riii  _2 

’  2  '■dtJ  a 


dt 


+  g2  ~ 


kMY  <r* 


c^Tp 


[40] 


a 


(  Airpa  ) 


dU  _  3U  da 

dt  L%  a  dt 


[Al] 


Differentiating  eq.  [AO]  with  respect  to  a,  one  obtains: 


3  ( d2a  i 

•  l  "2  ' 

3a  dt 


-  H  + 

a 


3Q„ 


Awpa  3a 


[A2] 


Substituting  eq.  [A2]  into  eqs*  [38]  and  [39]  one  finds: 


2  da-»3 


3  2 

a  fda-.d  a 


Q,--4<[4(^)  ]  -  fcpfCfJi}  -  [4a(f) 

dt 


dtJ 


2  d2ai  1  3Q- 


+  2a*  i-  -  (£)  -2. 
dt2  ac  3a  ^  “ 


[A3] 


3a 


ilie  evaluation  cf  the  last  three  terms  in  eq.  [A3]  still  presents  some 
difficulty.  However,  by  equating  the  rate  of  change  of  the  Hamiltonian 


5A5 


m 


H  of  the  bubble,  given  by  eq.  [23],  with  respect  to  time,  to  -dE/dt, 
tlmaly: 


dK 

dt 


dE 

dt 


[44] 


one  obtains  an  alternative  form  of  eq.  [40],  the  equation  of  motion.  A 
detailed  comparison  of  terms  shows  thac: 


.  _4.pl  K£)3  H  ♦  .2(^fW(£)) 
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By  comparing  eqs.  [45]  to  [43],  it  follows  that: 

2 


3  .  :i  .2  .2  ,  3Q 

-  „a  f  da'N  d  a  r.fdaN  *  d  a-i  1  a 

-8*p-  (dt)  77  -  -  Wdt'  +  2  ~2J  c 


dt 


dt 


3a 


.2  2 


[46] 


+  £e_  a4  ((i-ff/  (If.))  -  (If)  5? 


dt 


■dtJ 


3a 


The  terms  on  the  left  hand  side  of  eq.  [ 46 j  are  recognizable  as  being 
analogous  to  radiation  reaction  terms  in  electromagnetic  theory,  and 
hence  are  ignorable  in  a  first  approximation.  By  extension,  it  follows 

a3  da  d2a 

that  arp—  (-rr)  — =*  is  also  ignorable,  and  hence: 
c  at  dtz 


Qa  -  -  Wff)  ] 


[47] 


If  greater  accuracy  is  desired,  these  neglected  terms  can  be  evaluated, 
most  conveniently  by  means  of  [45],  and  added  to  the  generalised  force 
Sa¬ 
if,  following  Hicks  (1972),  one  introduces  a  drag  force,  Fp,  of 
the  form  of  eq.  [15],  one  can  show  that  in  [24]  ■S  given  by: 


_  2„2 

Qz  •  lVpa  U 


[48] 
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where  O'  is  given  by: 
a 


[51] 


or,  if  second  order  terms  are  included-  by: 


da-i  d  a 


dt 


In  this  section,  we  have  presented  2  sets  of  equations  of  motion 
which  had  been  previously  derived  by  various  authors  to  describe  the 
motion  of  a  spherical  bubble  under  different  conditions.  We  have 
extended  these  to  incorporate  more  rigorous  approximations  to  the 
effects  of  energy  loss  by  the  radiation  of  sound  as  well  an  estimate  of 
the  influence  of  drag  upon  the  motion. 


To  sum  up:  eqs.  [1-2]  are  the  equations  of  motion  for  a  spherical 
bubble  undergoing  radial  pulsations  and  translational  motion,  without 
energy  loss.  The  introduction  of  eq.  [9]  into  this  system  at  the  time 
at  which  tha  bubble's  radius  has  reached  a  local  minimum  provides  a 
crude  mechanism  for  incorporating  energy  loss  by  sonic  radiation. 
Equations  [10-11]  apply  when  the  effects  of  surfaces  such  as  the  sea 
bed  and  surface  significantly  affect  the  motion  of  the  bubble,  and  do 
not  incorporate  energy  loss.  Finally,  eqs.  [49-52]  are  the  equations 
of  motion  of  a  spherical  bubble  undergoing  radial  pulsations  and 
translational  motion  in  an  infinite  medium  and  in  wuich  the  effects  of 
energy  loss  from  drag  and  radiation  of  sound  have  taken  account  of  in  a 
reasonably  rigourous  fashion. 


* .  v*  **  i 


ill .  MUMERT CAL  METHODS  OF  SOLUTION.  Before  one  attempts  numerical 
solutions  of  any  of  these  sets  of  equations,  it  is  useful  to  make  the 
equations  of  motion  non-dimensional.  Thus,  the  substitution  of: 

a  «  a*  L, 


z  "  z*  L, 
t  *»  t*  1-, 

U  -  U*  L/T, 


into  the  equations  of  motion  used,  where 


<5-> 


vr 


yields  a  dimensionless  form  of  the  equations  of  motion.  As  before,  Y 
is  given  by  eq.  [3]  and  g  and  p  are,  respectively,  the  gravitational 
acceleration  and  the  der  ity  of  water.  These  particular  scaling 
factors  in  eq.  [2]  were  originally  used  by  Taylor  (1942). 

Since  all  of  the  equations  of  motion  have  the  unfortunate  property 
of  singularity  at  the  origin,  it  is  necessary  to  begin  the  integration 
with  a  series  solution.  Taylor  (1942)  suggested  initial  values  for  the 
dimensionless  variables  of: 

,*  2/5 

a*  -  ( _ _ ) 

'■l. 0025' 

O*  -  (±2)t*  ,  [551 

(Tf)  t'*)2 

for  values  of  t*  near  zero. 

A  similar  problem  arises  if  one  desires  to  use  the  more  accurate 
approximation  to  Q’,  eq.  [52],  in  the  equations  of  motion.  Since  da/dt 

becomes  0  at  several  points  during  the  motion  of  the  bubble,  the 
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.last  term  in  eq.  [53]  is  formally  undefined  at  those  points.  However, 
if  one  uses  the  expression  for  the  total  entity,  aq.  [1],  one  can 
write,  in  terms  of  the  dimensionless  variables: 


fda*i  r  ’ 

1  “  l* 


E*(a*)->1 

Y 


- 3 

2w  (a*; 


where 


Y(t)  is  the  total  energy  at  time  t,  and  all  other  variables  are  as 
previously  defined. 


If  one  defines: 


a  »  (Y(t)  -  E*(a*)j 


2x  (a*)‘ 


s  -  +  4  2* 


one  can  write: 


risif1  .  ±  i  (l  +  It  +  2.  (?.) 2  + 


where  the  positive  value  is  taken  while  the  bubble  is  expanding,  and 
the  negative  while  contracting.  Equation  [59]  may  be  used  to  evaluate 
the  expression  for  Q^,  eq.  [52],  if  one  wishes  to  Include  the  final  two 

terms.  When  such  was  done  In  this  work,  the  value  of  Y(t)  was 
approximated  at  each  step  of  the  integration  by  the  substitute  n  of  the 

d^a 

current  estimates  for  da/dt,  U  and  z  into  eq.  [1].  An  estimate  for  — sr, 

dt 

was  obtained  by  substituting  the  current  estimates  for  da/dt,  0  and  z 
into  eq.  [49]  with  set  to  0.  The  series,  eq.  [59],  can  be 

terminated  at  any  point,  and  the  truncation  error  determined  by 
reperforming  the  integration  with  the  inclusion  of  the  next  higher 
power  of  0/a.  In  this  work,  the  last  term  included  in  the  series  was 

(B/a)4. 
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The  actual  integrations  were  carried  out  using  a  4  point 
Runge-Kutta  algorithm  incorporating  automatic  error  controls. 

The  initial  parameters  which  must  be  supplied  to  the  programme  are 
listed  in  Fig.  2. 

IV.  NUMERICAL  RESULTS  AND  ANALYSIS.  Taylor  (1942)  solved  eqs.  [1]  and 
(2],  neglecting  the  internal  energy  of  the  gas  (i.e.  setting  E(a)  *  0). 
His  results  were  therefore  independent  of  the  mass  of  the  explosive 
charge,  and  hence  were  amenable  to  scaling,  a  convenience  which  was 
achieved  at  the  expense  of  some  accuracy,  especially  near  the  minimum 
radius.  Figures  3-5  show  a  comparison  of  the  radius,  velocity,  and 
height  above  the  original  explosion  for  a  charge  of  TNT  of  mass  2.1136 
kg.  detonated  at  a  depth  of  6.1  metres  below  the  surface,  calculated 
using  Taylor's  equations,  with  and  without  the  internal  energy  having 
been  neglected.  As  can  be  seen,  the  internal  energy  does  have  a 
significant  effect  on  the  behaviour  of  the  bubble.  Accordingly,  in  all 
subsequent  calculations,  it  should  be  understood  that  the  effect  of  the 
internal  energy  has  been  taken  into  account. 

Figures  6-8  compare  the  results  obtained  by  solving  Taylor's 
equation  of  motion  £1 )— [2 J ,  and  the  ones  derived  in  this  paper  eqs. 
[49]-[50]  with  the  non-conservative  generalized  forces,  Qa  and  Q2, 

having  been  set  to  zero  for  the  bubble  produced  by  an  explosive  charge 
of  2.1136  kg.  detonated  at  a  depth  of  34.81  metres  below  the  ocean 
surface.  Since  one  would  expect  the  integration  of  the  two  sets  of 
equations  to  yield  identical  results  under  these  conditions.  Figs.  6-8 
serve  as  a  test  of  both  the  theoretical  derivation  and  the  numerical 
algorithm.  As  can  be  seen,  the  results  of  the  two  sets  of  computations 
are  virtually  Identical.  It  should  be  noted  that  the  solution  of  the 
set  of  equations  of  motion  [49]~[50]  require  the  provision  of  an 
Initial  value  for  da/d t.  The  programme  permits  one  to  do  this  by  one 
of  two  ways:  either  by  differentiating  the  expree  on  for  a  in  eq., 

[55]  (SKIP  «  0),  or  by  substituting  the  values  for  a,  U,  and  z  obtained 
from  eq.  [55]  Into  eq.  [1]  and  solving  for  da/dt  at  the  initial  time 
(SKIP  *1).  This  latter  procedure  seems  to  be  the  more  accurate,  since 
it  amounts  to  specifying  the  initial  total  energy  to  be  equal  to  that 
given  by  e<>  [3].  Figures  9-11  show  the  dependence  of  the  calculated 

values  for  the  radius,  translational  velocity,  and  height  above  the 
explosion,  using  eqs.  [49]  and  [50],  (with  Qa  “  Qz  “  0)  on  the  raethed 

chosen  to  obtain  the  initial  value  of  da/dt.  Although  the  general 
behaviour  of  the  two  graphs  is  qualitatively  the  same,  there  is 
substantial  detailed  disagreement,  indicating  that  the  algorithm  is 
sensitive  to  the  initial  choice  for  da/dt. 

Figures  12-15  show  the  effects  of  energy  loss  by  the  radiation  of 

sound,  only,  (i.e.  Q  *  0)  on  the  motion  of  a  bubble  produced  by  an 

z 

explosive  charge  of  the  mass  and  at  the  depth.  The  energy  loss  was 
calculated  by  using  the  more  accurate  expression  for  Qa,  eq.  [52], 
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including  the  radiation  reaction  terms.  The  effect  of  omitting  these 
terms  (i.e.  using  the  value  for  Q^,  given  by  eq.  [51]  is  shown  in  Figs. 

16-19.  Evidently,  the  characteristics  of  the  motion  of  are 
•-ubstantially  affected  by  the  radiation  of  energy.  Specifically,  the 
maximum  radius  of  the  bubble  is  decreased  and  the  minimum  radius  is 
increased  in  comparison  with  those  which  one  would  obtain  by  neglecting 
energy  less.  The  radiation  of  energy  also  decreases  the  calculated 
periods  of  oscillation  slightly,  as  shown  in  Fig.  20. 

In  the  particular  example  illustrated  here,  whether  the  energy 
loss  is  calculated  using  eq.  [9]  or  the  formalism  developed  in  section 
II. 2  seems  to  have  little  affect  on  the  characteristics  of  the  motion 
for  this  case,  at  least  early  in  the  bubble  motion,  although  this 
difference  is  more  striking  when  the  approximate  radiation  reaction 
terms  are  included.  However,  as  shown  in  Figs.  15  and  19,  the 
distribution  of  the  energy  loss  over  time  depends  quite  strongly  on 
which  formula  for  energy  loss  is  used,  and  whether  the  radiation 
reaction  terms  are  included. 

Figures  21-24  show  the  effect  of  Including  drag  in  the  equations 
of  motion,  with  and  without  the  addition  of  radiative  loss,  for  the 
bubble  produced  by  227.27  kg.  of  TNT  at  a  depth  of  45.73  metres  below 
the  surface.  It  has  been  observed  that  a  bubble  from  such  an  explosion 
rises  approximately  3.35  metres  from  the  location  of  the  explosion  in 
the  time  taken  to  reach  its  first  maximum.  Hicks  found  that  where  drag 
is  the  only  source  of  dissipation,  a  drag  coefficient  of  CD  *  2.25  was 

necessary  to  reproduce  this  behaviour.  In  this  work,  it  was  found  that 
a  coefficient  of  Cp  *  1.85  best  matched  the  observed  rise  to  the  first 

minimum,  in  the  absence  of  radiation  of  sound.  When  radiative 
dissipation  was  considered,  a  drag  coefficient  of  Cp  *  1.6  seemed  best 

to  fit  the  observed  motion,  and  it  is  this  value  which  was  used  in 
Figs.  21-24. 

Finally,  Figs.  25-27  show  the  changes  in  the  motion  of  the  bubble 
resulting  from  the  inclusion  of  the  effects  of  the  sea  bed,  located  at 
a  depth  of  8.94  metres  below  the  explosion,  and  the  surface,  but  not 
those  of  energy  loss,  for  2.1136  kg.  of  TNT,  detonated  at  34.81  metres 
below  the  ocean  surface.  It  can  be  seen  that,  for  this  case,  the 
effects  of  the  radiation  of  sound  produce  more  significant  changes  in 
the  bubble's  motion  than  do  those  of  the  ccean  floor  and  surface, 
although  this  may  not  always  be  true.  It  should  also  be  noted  that  tl.2 
precise  characteristics  of  the  motion  seem  to  quite  sensitive  to  the 
initial  values  of  the  variables.  For  example,  if  one  uses  the  initial 
values  for  as  given  eq.  [3]  at  a  time  t*  «  .001,  integrates  eq.  [1]  and 
[2]  (i.e.  the  equations  of  motion  ignoring  bottom  and  surface  effects) 
to  a  time  t*  =  .01,  and  uses  the  values  obtained  as  starting  values  at 


t*  -  .01  in  the  integration  of  eqs.  [10]  and  [11],  (i.e.  the  equations 
of  motion  incorporating  surface  and  bottom  effects),  one  obtains  Figs. 
28-30,  which  exhibit  some  differences  from  Figs.  25-27,  notably  in  the 
values  obtained  for  the  upward  velocity  of  the  bubble  near  the  first 
minimum. 

V.  CONCLUDING  REMARKS.  In  this  work,  we  have  derived  a  more  precise 
expression  for  the  energy  loss  experienced  by  a  spherical  bubble 
through  the  radiation  of  sound,  and  shown  that  the  dissipative  force 
from  this  cause  is  proportional  to  the  cube  of  the  rate  of  change  of 
the  bubble  radius,  at  least  as  a  first  approximation.  We  have  also 
shown  how  the  effects  of  drag  may  be  combined  1th  those  of  radiation 
of  sound. 

We  have  presented  some  results  of  an  algorithm  which  solves  the 
various  forms  of  the  equations  of  motion.  We  suggest  that  the  form  of 
the  equations  of  motion  derived  in  section  II. 2  of  this  paper  are  more 
amenable  to  numerical  solution  than  those  derived  by  Taylor  (1942), 
especially  when  one  wishes  to  incorporate  the  effects  of  energy  loss, 
and  would  hence  recommend  their  employment. 

From  the  results  of  our  computations,  it  would  seem  that  one 
should  not  neglect  the  effects  of  internal  energy  and  energy  loss  from 
radatlon  and  drag  in  the  calculation  of  the  bubble's  behaviour,  but 
that  the  effects  of  free  and  rigid  surfaces  on  the  motion  are  usually 
of  lesser  significance. 

Finally,  since  the  non-sphericity  of  the  bubble  near  its  minimum 
radius  is  of  considerable  importance  to  the  motion,  th^  equations  of 
motion  should  be  extended  to  cover  this  case.  We  suggest  that  an 
approach  using  the  Lagrangian  formalism  might  have  oome  utility  in  this 
endeavour. 
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ABSTRACT.  We  discuss  an  adaptive  local  refinement  finite  element 
method  for  solving  initial-boundary  value  problems  for  vector  systems  of 
partial  differential  equations  in  one  space  dimension  and  time.  The  method 
uses  piecewise  bilinear  rectangular  space-time  finite  elements.  For  each  time 
step,  grids  are  automatically  added  to  regions  where  the  local  discretization 
error  is  estimated  as  being  larger  than  a  prescribed  tolerance.  We  discuss 
several  aspects  of  our  algorithm,  including  the  tree  structure  that  is  used  to 
represent  the  finite  element  solution  and  grids,  an  error  estimation  technique, 
and  initial  and  boundary  conditions  at  coarse-‘fine  mesh  interfaces.  We  also 
present  computational  results  for  a  simple  linear  hyperbolic  problem,  a 
problem  involving  Burgers'  equation,  and  a  model  combustion  problem. 


1.  INTRODUCTION.  There  is  an  ever  increasing  need  to  solve 
problems  of  greater  complexity  and  a  corresponding  need  for  reliable  and 
robust  software  tools  to  accurately  and  efficiently  describe  the  phenomena. 
Adaptive  techniques  are  good  candidates  for  providing  the  computational 
methods  and  codes  necessary  to  solve  some  of  these  difficult  problems.  Two 
popular  adaptive  techniques  are:  (i)  moving  mesh  methods,  where  a  grid  of  a 
fixed  number  of  finite  difference  cells  or  finite  elements  is  moved  so  as  to 
follow  and  resolve  local  nonuniformities  in  the  solution,  and  (ii)  local 
refinement  methods,  where  uniform  fine  grids  are  added  to  coarser  grids  in 
regions  where  the  solution  is  not  adequately  resolved.  A  representative 
sample  of  both  types  of  methods  is  contained  in  Babuska,  Chandra,  and 


1  The  authors  were  partially  supported  by  the  U.  S.  Air  Force  Office  of 
Scientific  Research,  Air  Force  Systems  Command,  USAF,  under  Grant  Number 
AFOSR  80-0192  and  the  U.  S.  Army  Reseurch  Office  under  Contract  Number 
DAAG29-82-K-0197. 
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Fhherty  [2].  Recently,  Adjerid  and  Flaherty  [1]  developed  a  finite  element  ! 

method  that  combines  r  esh  moving  and  refinement.  | 

Herein,  we  discuss  a  local  refinement  finite  element  procedure  for 
finding  numerical  solutions  of  M-dimensional  vector  systems  of  partial 
differentia!  equations  having  the  form 

Lu  :=  ut  +  f(x,t,u,ux)  -  [D(x,t,u)ux]x  =  0,  j 

a  <  x  <  b,  t  >  0,  (1.1)  .  | 

subject  to  the  initial  conditions  j 

u(x,0)  =  u°(x)  ,  a  <  x  <  b  ,  (1.2)  j 

j 

and  appropriate  boundary  conditions  so  that  the  problem  has  a  well  posed  I 

solution.  ! 

i 

We  discretize  (1.1,2)  for  a  time  step  using  a  finite  element-Galerkin 
procedure  with  piecewise  bilinear  approximations  on  a »  rectangular  space-time 
net.  At  the  end  of  each  time  step  we  estimate  the  local  discretization  error, 
add  finer  subgrids  of  space-time  elements  in  regions  of  high  error,  and 
recursively  solve  the  problem  again  in  these  regions.  The  process  terminates 
when  the  error  estimate  on  each  grid  is  less  than  a  prescribed  tolerance.  The 
original  coarse  space- time  grid  is  then  carried  forward  for  the  next  time  step 
and  the  strategy  is  reoeated.  Our  algorithm  is  discussed  further  in  Flaherty 
and  Moore  [9]  and  some  of  this  discussion  is  repeated  in  Section  2. 

Berger  [3]  used  a  similar  local  refinement  procedure  to  solve  one-and  ; 

two-dimensional  hyperbolic  systems.  She  used  explicit  finite  difference 
schemes  to  discretize  the  partial  differential  equations,  while  we  use  implicit 
finite  element  techniques  since  we  are  primarily  interested  in  paraboi-c  j 

problems.  j 

! 

In  addition  to  the  discretization  technique,  the  major  numerical 
questions  that  must  be  answered  as  part  of  the  development  of  a  local 
refinement  code  are  (i)  the  estimation  of  the  discretization  error  and  (ii)  the 
appropriate  initial  and  boundary  conditions  to  apply  3t  coarse-fine  mesh 
interfaces.  Of  course.  Computer  Science  questions,  such  as  which  language 
to  use  to  describe  and  implement  the  various  algorithms  and  what  data 

structures  to  use  to  represent  and  store  the  grids  and  solutions  must  also  be 

answered.  Gur  work  in  ail  of  these  areas  is  still  far  from  complete  and  herein 

we  only  discuss  our  progress  and  thoughts  on  error  estimation  techniques, 
data  structures,  and  interface  conditions  (cf.  Section  2).  In  Section  3,  we 
present  the  results  of  three  examples  that  illustrate  our  method  and  the  ! 

discussion  of  Section  2,  and  in  Section  4,  we  present  some  preliminary 

conclusions  and  future  plans. 

2.  FINITE  ELEMENT  ALGORITHM.  We  discretize  equation  (1.1)  on  a 
strip  a  <  x  <  8,  p<t<q  using  a  finite  element-Galerkin  method  with  a 
uniform  grid  of  N  rectangular  elements  of  size  (fs  -  o)/N  by  (q  -  p).  We 
refer  to  this  grid  as  R(cc,fS,p,q,N,f,s),  where  f  and  s  are  pointers  to  the 
father  and  son  grids  discussed  later.  Each  grid  uses  records  to  store  the 
appropriate  information. 
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We  generate  the  discrete  system  on  R(a,  (5,  p,q,  N,  f,  s)  in  the  usual 
manner;  thus,  we  approximate  u  by  U(x,t)  and  select  test  functions  V(x,t), 
where  U  and  V  are  elements  of  a  space  of  C°  bilinear  polynomials  with  respect 
to  the  grid  R.  We  then  take  the  inner  product  of  equation  (1.1)  and  V, 
replace  u  by  U,  and  integrate  any  diffusive  terms  by  parts  to  obtain 


/  [VTU 
R 


VTf(x,t,U,Ux)  ♦  VxD(x,t,U)Ux]dxdt 


R  x  6 

-/  V  D(x,t,U)U  |Jdt  =  0  .  (2.2) 

P 

Equation  (2.2)  must  vanish  for  all  bilinear  functions  V  on  the  grid  R.  The 
integrals  are  approximated  using  a  four  point  Gauss  quadrature  rule  and  the 
resulting  nonlinear  system  is  solved  by  Newton  iteration  (cf.,e.g.,  [7]  for 
additional  details).  Appropriate  initial  and  boundary  conditions  for  (2.2)  are 
discussed  later  in  this  section. 

We  describe  our  local  refinement  procedure  for  solving  problem  (1.1,2) 
for  one  time  step  (t 0 , t 1 )  on  a  coarse  grid  with  N°  elements,  i.e,  on 
R('a,b,t#,t1,N°,0,s)  (where  the  pointer  f  =  0  signifies  that  this  grid  has  no 
father).  To  solve  this  problem  we  simply  call  the  procedure  ''locref"  with  the 
arguments  R(a,b,t°,tl,N°,0,s) ,  tol,  tsub  for  each  coarse  grid  time  interval. 
A  pseudo-PASCAL  description  of  the  procedure  "locref"  is  shown  in  Figure  1. 


procedure  locref  (R(a,  P,  p,q,  N,f ,  s) ,  to!,  tsub) 
begin 

Solve  the  finite  element  equations  (2.2)  on  R(a,p,p,q,N,f,s); 

Estimate-  the  error  on  R(a,P,p,q,  N,f,s); 
if  error  >  tol  then 
begin 

calculate  where  error  >  tol  and  return  the  son  grids; 
for  j  :=  1  to  tsub  do 

for  i  :=  1  to  number  of  sons  do 
begin 

p[j]  :=  P  +  (j-1)*(q-p)/tsub; 

R[j]  :=  p[j]  +  (q-p)/tsub; 
locref  (R(a[i],3[ij,p[j],q[j],N[i], 
R(a,p,p,q,N,f,s),s[i],  tol,  tsub) 

end 

end 

end; 

Figure  1.  Algorithm  for  local  refinement  solution  of  (1.1,2)  on 
R(a,  p,p,q,N,f,s)  with  an  error  tolerance  of  tol  and  dividing  the  local  time 
step  by  tsub  each  time  the  error  test  is  not  satisfied. 


The  recursive  algorithm  locref  sets  up  a  tree  structure  of  grids  with 
R(a,b,t#,tl,Ns,0,s)  being  the  root  nods  and  with  the  solution  being 
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generated  by  a  preorder  traversal  of  the  tree  at  each  local  time  step.  For 
example,  «f  the  root  grid  is  refined  to  give  two  subgrids  and  the  time  step  is 
halved,  then  the  problem  is  solved  on  the  first  subgrid  on  its  first  time  step, 
then  on  the  second  subgr'd  on  the  same  time  step,  then  this  procedure  is 
repeated  for  the  second  time  step.  The  error  is  estimated  by  Richardson 
extrapolation,  i.e.,  the  space  and  time  steps  are  halved  and  the  problem  is 
solved  again  on  this  new  grid.  The  two  solutions  that  are  obtained  at  each 
original  grid  point  are  used  to  generate  an  error  estimate.  If  this  pointwise 
estimate  exceeds  the  tolerance  "to!",  finer  grids  are  added  as  leaf  nodes  to 
the  tree.  Tnis  procedure  is  similar  to  one  used  by  Berger  [3];  however, 
there  are  more  economical  error  estimation  strategies  (cf.,  e.g.,  Bieterman 
and  Babuska  [5,  6])  i';hich  we  a.  a  currently  investigating. 

In  order  to  solve  the  finite  element  system  (2.2)  we  need  to  supply 
initial  and  boundary  conditions.  On  any  grid  with  p  =  0,  a  =  a,  or  B  =  b 
these  can  oe  obtained  from  the  initial  condition  (1.2)  or  o  -escribed  boundary 
conditions.  However,  artificial  initial  and  boundary  conditions  must  be  created 
at  all  other  coarse-^ne  mesh  interfaces.  This  is  «  difficult  and  crucial 
problem  that  is  discussed  for  explicit  finite  difference  methods  by  Berger 
[3,  4];  however,  it  is  largely  unanswered  for  finite  element  applications. 
Instabilities  or  incc-rect  solutions  (cf.  Example  1  of  Section  3)  can  result  if 
inappropriate  conditions  are  specified. 

For  initial  conditions,  two  strategies  immediately  come  to  mind:  (i) 
saving  all  fine  grid  data  for  propagation  in  time  or  (ii)  interpolating  the  best 
coarse  grid  data  to  finer  grids.  We  consider  a  blend  of  the  two  strategies 
which  consists  of  saving  the  fine  grid  data  down  to  a  given  level  X  in  the 
tree  and  subsequently  interpolating  for  finer  grids.  F.ach  grid  in  the  first  X 
levels  either  has  a  iinked  list  of  the  initial  data  directly  associated  with  it  or 
uses  an  initial  data  list  of  an  ancestor  grid.  To  find  the  value  of  the  solution 
ac  some  new  «vtial  point,  the  coordinate  of  that  point  is  sequentially  compared 
to  value*  in  the  linked  list  until  an  interval  containing  the  point  is  found  so 
that  interpolation  can  be  used.  This  is  costly  ar.d  we  are  investigating  more 
efficient  procedures  that  use  the  natural  ordering  that  already  exists.  We 
used  either  piecewise  linear  interpolation  or  piecewise  parabolic  interpolation 
with  shape  preserving  splines  developed  by  McLaughlin  [10] .  For  each  grid 
in  the  first  X  levels  of  the  tree,  a  linked  list  is  created  to  store  the  initial 
data.  Wa  are  studying  several  alternative  ways  of  determining  a  proper  value 
for  X. 


At  the  present  time,  we  prescribe  internal  Dirichlet  boundary  conditions 
by  linearly  interpolating  from  coarse  to  finer  grids.  A  buffer  zone  of  two 
elements  is  added  to  each  end  of  regions  of  high  error  that  do  not  intersect 
the  boundaries  x  =  a  and  b.  If  two  buffer  zones  overlap  or  are  separated 
from  one  another  by  one  element,  the  two  grids  are  joined.  Similarly,  if  the 
buffer  is  only  one  element  away  from  either  a  or  b,  that  element  is  added  to 
the  grid. 


3.  NUMERICAL  EXAMPLES.  An  experimental  code  based  on  the 
algorithms  in  Section  2  has  been  written  in  FORTRAN-77.  We  are  testing  >t 
on  several  examples,  some  of  these  follow  and  others  are  presented  in  [9j . 
All  results  were  computed  in  double  precision  on  an  IBM  3081 D  computer. 


t  ft  {  * 


Example  1.  In  order  to  illustrate  the  importance  of  adequately  resolving 
initial  conditions  at  each  time  step  we  solve  the  linear  hyperbolic  initial  value 
problem 


u.  +  u  =  0  , 
t  x 


u(x,0)  =  u°(x)  = 


(1/2) (cos (20r(x -0.45))  -  1)  , 
0  ,  otherwise 


0.35  <  x  <  0.75 


We  solve  this  problem  for  one  coarse  time  step  of  At  =  0.05,  10  elements  on  0 
<  x  <  1,  tol  =  0.01.  For  small  enough  times  the  exact  solution  is  u#(x-t).  If 
initial  conditions  are  interpolated  from  the  coarse  to  the  fine  grid,  the 
oscillations  are  missed  and  an  incorrect  solution  is  computed,  possibly  without 
a  user  realizing  that  there  is  anything  wrong.  However,  saving  initial  values 
for  the  first  8  levels  of  the  tree  of  grids  calculates  the  correct  solution  to  the 
prescribed  accuracy.  The  incorrect  and  correct  solutions  are  shown  at 

t  =  0.05  in  Figure  2. 

Example  2.  We  solve  the  following  problem  for  Burgers’  equation: 

u.  +  uu  =  du  ,  0  <  x  <  1  ,  0  <  t  <  1  , 
t  x  xx 

u(x,0)  =  sinirx  ,  0  <  x  <  1  , 
u(0,t)  =  u ( 1 , t)  =  0  ,  t  >  0  . 

We  choose  d  =  C. 00003,  a  coarse  grid  of  10  elements  and  At  =  0.1,  and 

piecewise  parabolic  approximations  for  the  initial  conditions  with  X  =  6.  I c  is 
well  known,  that  the  solution  of  this  problem  is  a  "pulse"  that  steepens  as  it 
travels  to  the  right  until  it  forms  a  shock  layer  at  x  =  1.  After  a  time  of 

0(1/d)  the  pulse  dissipates  and  the  solution  decays  to  zero.  We  solve  this 

problem  for  tol  -  0.01  and  0.001  and  show  the  solutions  at  t  =  0.4  in  Figure 
3.  The  solution  with  the  cruder  tolerance  is  exhibiting  some  oscillations  that 
are  within  our  bounds.  These,  however,  are  not  visible  when  the  finer 
tolerance  is  used  to  solve  the  problem. 

Example  3.  We  solve  the  model  combustion  problem 
ut  +  ux  -  2eu  =  uxx  ,  0  <  x  <  1  ,  0  <  t  <  1  , 
u(x,0)  =  0  ,  u(0,t)=0  ,  ux(1.t)  =  0  . 

The  exponential  nonlinearity  is  typical  in  combustion  problems  having 
Arrhenius  chemical  kinetics.  However,  in  this  case  the  solution  develops  a 
"hot  spot"  at  x  =  1  and  becomes  infinite  when  t  is  approximately  0.85.  We 
choose  a  coarse  grid  of  20  elements  and  At  =  0.05,  tol  =  0.001,  and  X  =  6.  In 
Figure  4  we  show  the  computed  solution  U(x,t)  as  a  function  of  x  for  t  = 
0.05,  0.6,  and  0.8  and  in  Figure  5  we  show  the  mesh  that  was  used  to  solve 
the  problem.  We  see  that  the  mesh  is  initially  concentrated  in  the  region  near 
x  =  0  where  the  curvature  of  the  solution  is  largest.  As  time  progresses  and 
the  curvature  diminishes',  excessive  refinement  is  not  necessary.  Finally,  as 
the  solution  begins  to  "blow-up"  our  algorithm  generates  a  fine  mesh  only  in 
the  region  near  x  =  1 . 


4.  DISCUSSION  AND  CONCLUSIONS.  We  have  briefly  described  an 
adaptive  local  refinement  algorithm  for  solving  time  dependent  partial 
differential  equations.  Even  though  this  is  very  mjch  a  working  algorithm, 
and  not  a  production  code,  we  are  very  encouraged  by  the  preliminary 
results.  We  are  investigating  several  possible  ways  of  improving  the 
efficiency  and  robustness  of  ou.*  algorithm.  These  include  adding  higher 
order  polynomial  finite  element  approximations,  adaptively  changing  the 
number  of  elements  that  are  earned  forward  in  the  coarse  grid  at  each  coarse 
time  step,  how  to  select  the  appropriate  buffer  length,  adaptively  determining 
the  optima!  number  of  levels  of  initital  conditions  to  keep  at  coarse-fine 
interfaces,  and  the  best  boundary  conditions  to  apply  at  internal  boundaries. 
We  are  encouraged  by  the  performance  of  McLaughlin’s  [10]  shape  preserving 
parabolic  splines;  however,  the  entire  area  of  interpolating  from  coarse  to 
fine  grids  needs  further  study.  We  are  also  developing  non-Dirichlet 
'‘natural"  boundary  conditions  to  use  at  coarse-fine  mesh  interfaces. 

Finally,  we  are  very  interested  in  combining  the  moving  mesh  strategy 
of,  e.g.,  [7,  8]  with  the  present  local  refinement  strategy  and  extending  our 
methods  to  two  and  three  dimensions. 
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Post-Buckling  Analysis  of  an  Elastica  with 
with  One  and  Many  Critical  Loads 


Iradj  3.  Tadjbakhsh* 


Abstract 

Static  stability  of  an  elastic  system  composed  of  a  flexible  and 
a  rigid  link  leads  to  a  nonlinear  eigenvalue  problem  with  a  single 
buckling  load  or  infinite  buckling  loads  when  the  flexible  link  is  in 
tension  or  compression,  respectively,  integration  of  equations  of 
equilibrium  leads  to  nonlinear  singular  integral  equations  which  can 
be  analyzed  and  solved  numerically. 
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Introduction 


Among  the  problems  arising  from  consideration  of  stability  of 
elastic  rods  under  axial  loads  is  the  class  of  nonlinear  eigenvalue 
problems.  This  type  of  problems  describe  the  buckling  loads  of  the 
rod,  bifurdation  of  the  solutions  at  critical  stability  conditions 
and  their  post  buckling  behavior  [1,2]. 

A  particular  case  is  the  buckling  of  a  straight  column,  fixed  at 
the  base  and  acted  upon  by  a  rigid  crank  moving  up  and  down  at  an  end 
along  the  column  zeds  and  hinged  at  the  other  end  to  the  free  tip  of 
the  column.  This  state  of  affairs  produce  alternately  states  of 
tension  and  compression  in  the  column.  For  the  upward  motion  of  the 
crank.  Fig.  1,  the  column  is  in  tension  and,  yet,  it  will  buckle 
under  a  single  critical  load.  By  contrast,  tho  downward  motion  of 
the  crank  is  characterized  by  the  column  being  in  compression  and  by 
infinitely  many  critical  loads. 

Integrations  of  the  equations  of  equilibrium  lead  to  singular 
nonlinear  integral  equations  which  can  be  solved  numerically  after 
appropriate  limits  are  obtained.  Results  indicate  that  postbuckling 
behavior  is  characterized  by  gradual  decrease  in  the  load  carrying 
capacity  of  the  system  as  deflections  grow.  This  is  the  type  of 
behavior  which  is  indicative  of  sensitivity  to  imperfections  in 
column  geometry  or  material  [3]. 
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Poraulatior  and  Solution 

Referring  to  Pig.  1,  an  elastic  rod  BC  in  considered.  The  rod 
i3  fixed  at  C  and  alternately  in  tension  or  compression  as  the  rigid 
rod  AB  is  pushed  upward  or  downward  along  the  axis  AC.  Per  load  P 
less  than  a  critical  value  the  elastica  will  be  in  an  unbuckled  state 
and  lying  along  the  x-axis  The  critical  value  of  P  and  the 
poet  -buckling  behavior  of  the  elastic  rod  will  be  different  for  the 
two  states  of  loading. 

Considering  first  the  case  of  upward  motion  of  hinge  A,  one  may 
expreaa  the  axial  force  T  and  the  transverse  shear  N  in  the  elastica 
by 


T  cosd  -  K  sind  ■  P 


1*  sind  +  N  cosd  ■«  p  tanl 


U) 


Also  noting  that  N  «  -El  d ",  we  obtain  after  eliminating  T 


El  d"  +  p  (tand  cosd  -  sind)  -0,  o  <  s  <  1,  ('»  d/ds) 


(2) 
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The  relation  between  the  deflection  y(s)  and  d  is 
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y  -  ain0(c)ds 
and  furthermore 


(3) 


2  2  1/2 

tan£-y(i)/[d  -y  (1)3  '  - 


(*> 


where  q  jus  the  length  of  AB.  The  boundary  conditions  that  accceypany 
(2)  are 


6(0)  -  0’( A)  -  0  (5) 

During  the  downward  motion  oi  A,  again  acted  upon  by  a  force  P 
which  is  considered  positive,  (3)  -  (5)  remain  the  same  but  (2) 
changes  and  becomes 

El  0"  +  P  (tan>8  co»0  +  3in0)  »  0  (6) 

The  linearized  version  of  the  problem  defined  by  (2)  -  (5)  has 

only  a  single  eigenvalue  and  corresponding  eigenfunction.  By 


te 


contrast  when  ( 6 )  is  considered  in  place  of  ( 2 )  the  problem  possesses 
infinitely  many  eigenvalues  and  eigsnf unctions .  The  post-buckling 
regime  is  characterized  by  a  gradual  decrease  in  the  value  of  P  as 
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th®  amplitude  of  deflection 


grow*. 


Let 


e  »  #  «  s/i,  r  -  i/d,  *<*,€)  «  <J/€ 

a<«)  -  7<*)/i,  «<€)  «  iN/^i 


(7 


«i*n  { 2 )  -  (5)  yield 

+  (a2/«)  (tiuLS  coa«4>-sine6)  •  o,  o  <  f  <  i 
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additionally 

1 

®  ”  /  sin~(*  d£ 

o  (11 

tan^  «  rO/  (i~x20Z)1/2 


on  physical  grounds  or*  may  expect  that  the  load  should  be  an 
even  function  of  amplitude  €  and  that  the  deflection  should  be  odd  in 


Eliminating  a  between  (.27)  and  (23)  yields 


(29) 


which  determines  -5  as  a  function  of  0  and  e.  Substituting  (29)  into 
the  right  side  of  (12)  produces  an  implicit  equation  determining  /?  a* 
a  function  of  €  for  fixed  r.  Ones  /3(e)  is  found,  (29)  determines 
0(€)  and  subsequently  (28)  will  yield  a(€). 

In  calculating  the  integrals  that  occur  in  (27)  -  (29) 

singularities  are  encountered  for  (i)  $  «  1,  ( ii)  €  -  0  and 

(iii)  t&nfi  m  oa.  Briefly  we  describe  how  in  each  case  the 

sin.Tularities  may  be  remoe  d. 

(i)  Consider  the  integral  in  (27)  and  note  that  by  using 

binoznial  expansion  we  have 

i/e  -  i/g'  *  a  (30) 

where 

*  1/2 

Q  m  [e  (ta a.6  cosc-sin€ )( 1—^)3  (31) 

1/2  * 

and  R  -  [0  (l-$)  ].  Now  l/Q  is  integrable  analytically  and  R 

vanishes  at  $  -  l.  The  integration  in  (27)  can  proceed  with  the  aid 
of  decomposition  indicated  in  ( 30 ) .  A  further  removable  singularity 


in  integral  of  (27)  occurs  when  RJ 


is  considered:  for  €  »  0.  Kow 


$"•0 


—1/2  —1/2 
R|^0  “  (tan£  sin£+cos€-l)  -[(tan/9cos€“Sin€)€  j 


which  becomes-  indefinite  of  the  fora  »  -  as  €  -•  0.  A  Halting 
process  shows  that  lia.  R| , _ -  o. 

9*0 

(ii)  the  limit  of  e  »  0  reduces  the  nonlinear  problem  to  the 
linearised  problen  described  sarlier .  This  is  accomplished  if  we 
obtain  limit  of  a  from  (27)  which  is 

ao  *  (2)"1/2  /1{(l-*)[r0o-(l+«>>/23)’1/2  d*  (32) 

o 


Similar  limiting  process  when  applied  to  (29)  produces 

0Q  -  (i /6ao)  /  ♦  {(l-^)Cr0o-dH)/2]}^1/2  d*  (33) 

o 


Carrying  out  the  integrations  and  solving  (32)  -  (33)  for  aQ  and  0Q 
the  results  in  (18)  and  (20)  are  obtained. 

(iii)  This  is  the  limit  when  the  rigid  rod  AS  is  in  a  horizontal 
position  .in  its  up  and  down  motion.  Corresponding  to  this 
situation  tan/9  »  «*>,  a  -  0,  fl  -  1/r  and  €  tends  to  a  limit  which  will 


be  denoted  by  €.  Since 


1/2  -1  '  1/2 
lia  (tan/3)  Q  *  q<  €,$)-( sine  -  sin€$) 

tan£-*«» 


it  follows  from  (29)  that 


r  -  /  q<  €,$)d$/Jq(  €<j>)sin€$.d$ 


This  relation  determines  the  inclination  9(1)  »  e  for  the 
position  tan/S  » 

Pig .  2  shows  the  result  of  numerical  determinations  giving  or  as 

a  function  of  0  for  various  values  of  r.  The  results  for  a  >  o 
pertain  to  the  upward  stroke  of  the  hinge  A  and  those  for  a  <  0  show 
the  load-deflection  relationships  when  A  moves  downward.  Por  column 
in  compression  only  the  first  buckling  loads  are  shown.  The  collate 
picture  includes  points  obtained  by  symmetric  reflection  about 
the  a-axis . 
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ABSTRACT.  We  discuss  an  adaptive  mesh  moving  technique  that 
can  be  used  with  a  finite  difference  or  finite  element  scheme  to  solve 
initial-boundary  value  problems  for  vector  systems  of  partial  differential 
equations  in  two  space  dimensions  and  time.  The  mesh  moving 
technique  is  based  on  an  algebraic  node  movement  function  determined 
from  the  propagation  of  significant  error  regions.  The  algorithm  is 
designed  to  be  flexible,  so  chat  it  can  be  used  with  many  existing 

finite  difference  and  finite  element  methods.  To  test  the  mesh  moving 
algorithm,  we  implemented  it  in  a  system  code  with  an  initial  mesh 
generator  and  a  MacCormack  finite  volume  scheme  on  quadralateral  cells 
for  hyperbolic  vector  systems.  Results  are  presented  for  several 
computational  examples.  The  moving  mesh  scheme  reduces  dispersion 

errors  near  shocks  and  wave  fronts  and  thereby  reduces  the  grid 
requirements  necessary  to  compute  accurate  solutions  while  increasing 
computational  efficiency. 

1.  INTRODUCTION.  Mesh  moving  an  adaptive  technique  that 
has  been  used  successfully  to  improve  the  accuracy  of  both  finite 

element  and  finite  difference  schemes  for  a  variety  of  time  dependent 

problems  in  one  space  dimension  (cf.,  e.g.,  [1,2,10,11,14,18,20,23]). 
The  essential  idea  is  to  derive  equations  so  that  the  mesh  moves  either 
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to  extremize  some  quantity,  e.g.,  to  minimize  the  discretization  error, 
or  to  follow  some  local  nonuniformity,  e.g.,  a  wave  front.  This 
generally  reduces  dispersive  errors  and  allows  the  use  of  larger  time 
steps  while  maintaining  accuracy.  For  example,  with  a  fixed  mesh  a 
wave  front  may  move  through  a  cell  in  one  time  step  causing 
significant  dispersive  errors  (cf.  Figure  la);  whereas,  a  moving  rnesh 
with  the  same  time  step  can  follow  the  wave  front  and  keep  it  within 
the  same  cell  (cf.  Figure  Ibl 


Uave  Front  Trajectory 


Wave  Front  Trajectory 


T<r-1 


Figure  1.  Wave  Front  Trajectory  on  a)  Stationary  Mesh  (top)  and  b) 

Moving  Mesh  (bottom). 


Mesh  moving  algorithms  have  often  been  related  to  the  numerical 
integration  scheme  and/or  the  problem  being  solved.  In  one  dimension, 
for  example,  Hyman  [19]  moves  a  mesh  to  minimise  the  time  variation 
of  the  iolution  ax  the  nodes.  This  scheme  uses  finite  difference 
approximations  for  solving  hyperbolic  conservation  laws.  Davis  and 
Flaherty  [9]  develop  a  finite  element  code  for  parabolic  systems  that 
moves  the  mesh  so  as  to  equidistribute  the  spatial  component  of  the 
discretization  error.  Miller  et  al.  [15,20,2'i]  couple  the  node  position 
equations  into  the  finite  element  variational  equations  and  min'mize  the 
residual  in  solving,  parabolic  problems.  Bel!  and  Shubin  [21  solve  the 
Euler-Lagrsnge  equations  of  an  extromizing  functional  and  use  a  finite 
difference  scheme  to  solve  hyperbolic  conservation  laws.  AH  of  these 
schemes  have  successfully  demonstrated  that  mesh  moving  can  red  tee 
error  and  provide,  improvements  in  computational  efficiency  for 
one-dimensionai  problems. 
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With  some  modification  the  Hyman  and  Miller  algorithms  can  be 
extended  to  higher  dimensions;  however,  many  other  mesh  moving 
techniques  are  not  directly  applicable  to  two-  and  three-dimensional 
problems.  One  difficulty  is  that  equidistribution  strategies  fail  to 
produce  unique  solu+ions.  Brackbill  and  Saltzman  [7,26]  have  overcome 
this  problem  by  adding  the  constraints  of  mesh  smoothness  and 
orthogonality  to  a  variational  problem. 

A  sucessful  mesh  moving  scheme  for  higher  dimensional  problems 
that  is  somewhat  simile  to  the  method  presented  here  is  the  algorithm 
of  Rai  and  Andei  [23,24,25].  Their  algorithm  is  based  on  a 
gravitional  principle  and  calculates  the  velocity  of  a  node  based  on  the 
difference  between  its  error  and  the  mean  error.  The  displacement  of 
one  node  with  respect  to  another  is  inversely  proportional  to  the 
distance  between  them.  A  summation  over  all  nodes  is  necessary  to 
determine  each  node's  speed  in  a  computational  grid. 

A  different  adaptive  technique  is  local  mesh  refinement  which 
consists  of  dividing  or  refining  elements  in  regions  where  the  solution 
is  not  adequately  resolved.  The  advantage  of  this  technique  relative  to 
mesh  moving  is  that  enough  fine  grids  can  he  added  to  resolve  the 

small  scale  structures  of  the  solution  and  provide  solutions  to  within 
user  prescribed  error  tolerances.  The  local  mesh  refinement  schemes  of 
Berger  [3,4],  Flaherty  and  Moore  [13],  Gannon  [16]  and  Bieterman 
and  Babuska  [5,6],  have  successfully  satisfied  user  tolerances  for 
different  problems  using  finite  element  or  finite  difference  schemes  in 
either  one  or  two  dimensions. 

The  most  promising  algorithms  appear  to  be  those  that  combine 

both  mesh  moving  and  local  mesh  refinement.  While  neither  adaptive 
technique  or  their  combination  is  likely  to  be  optimal  for  most 

problems,  a  combination  can  accurately  solve  for  the  solution  in  regions 
where  it  varies  rapid!/  and  devote  little  effort  in  regions  where  it 
varies  slowly.  It  is  our  intention  to  consider  such  schemes;  however, 
the  computational  procedures  discussed  here  do  not  as  yet  contain  local 
refinement. 

The  mesh  moving  scheme  we  have  developed  is  simple,  efficient, 
and  independent  of  the  numerical  method  being  employed  to  discretize 
the  partial  differential  equations.  At  each  time  step  it  uses  the  current 
node  locations  and  the  nodal  values  of  a  mesh  movement  indicator.  We 
use  local  error  estimates  as  mesh  movement  indicators,  but  other 

computable  values  such  as  solution  gradients  or  curvature  could  be 
used.  Nodes  with  "statistically  significant  error"  (cf.  Section  2)  are 
grouped  into  rectangular  error  clusters.  This  clustering  separates 
soatially  distinct  phenomena  of  the  solution.  As  time  evolves  the 
clusters  can  move,  change  size,  change  orientation,  collide,  separate, 
reflect  off  boundaries,  or  pass  through  boundaries.  At  each  time  step 
new  clusters  can  be  created,  and  old  ones  can  vanish.  The  clustering 
algorithms  we  use  are  briefly  described  in  Section  2  and  were 
developed  by  Berger  [3,4]  for  a  mesh  refinement  scheme  for  solving 
hyperbolic  problems. 


Mesh  movement  is  determined  by  the  node's  relationship  to  its 
nearest  error  cluster.  Movement  is  done  in  two  steps,  each  in  a 
direction  along  a  principal  axis  of  a  cluster  rectangle.  The  amount  of 
movement  in  each  direction  is  determined  by  a  movement  function  which 
insures  that  the  center  of  error  of  the  cluster  moves  according  to  a 
differentia!  equation  suggested  by  Coyle  et  al.  [8].  Additionally,  the 
movement  function  smoothes  mesh  motion,  reduces  distortion,  mesh 
tangling,  or  overlapping,  and  prevents  nodes  from  moving  outside  the 
domain  boundaries. 

In  Section  2  we  discuss  error  clustering,  movement  of  the  center 
of  mass  of  the  error  cluster,  the  node  movement  function,  and  the 
■nitial  mesh  generator  used  in  the  computational  examples.  In  Section  3 
we  discuss  the  MacCormack  finite  volume  scheme  for  hyperbolic 
equations  and  the  error  estimates  used  in  the  computational  examples. 
The  results  of  the  computational  examples  are  given  in  Section  4,  and 
Section  5  contains  a  discussion  of  the  results  of  the  experiments  and 
the  status  of  our  algorithm. 

2.  MESH  MOVING  SCHEME  AND  INITIAL  MESH  GENERATION.  We 
suppose  that  an  approximate  solution  of  the  partial  differential  equation 
and  a  poir.twise  error  estimate  have  been  calculated  by  some  numerical 
technique  at  the  current  time  step,  we  then  flag  "significantly  high 
error  nodes"  as  nodes  with  error  greater  than  twice  the  mean  nodal 
error  and  also  greater  than  a  user  supplied  error  tolerance.  If  there 
are  no  significant  error  nodes,  computation  is  performed  on  a 
stationary  mesh.  Next  the  nearest  neighbor  clustering  algorithm  of 
Berger  [3,4]  is  used  to  duster  flagged  error  nodes.  The  nearest 
neighbor  clusters  have  internodal  distances  in  the  cluster  less  than 

intercluster  distances,  which  are  the  minimum  distances  between 
clusters.  The  formation  of  a  cluster  is  done  iteratively  by  starting 

with  a  nods  and  including  nodes  in  a  cluster  if  the  distance  from  the 
node  to  the  cluster  is  less  than  a  specified  distance.  When  a  node  is 
determined  to  belong  to  two  or  more  clusters,  those  clusters  are 
merged. 

Berger  [3,4]  shows  that  near  minimum  area  rectangles  that 

contain  all  the  nodes  within  the  cluster  can  be  easily  generated.  The 

principal  axes  of  such  a  rectsngie  are  the  major  and  minor  axes  of  an 

enclosed  ellipse  with  the  same  first  and  second  moments  as  the 

clustered  nodes.  Thus,  if  x  and  y  are  the  mean  coordinates  of  the 

m  m 

clustered  nodes,  then  the  axes  of  the  rectangle  are  the  eigenvectors  of 
he  symmetric  (2x2)  matrix 


For  problems  with  significant  error  nodes  located  on  a  long 
curved  line,  the  entire  region  will  belong  to  one  unacceptably  large 
cluster,  in  order  to  prevent  this  inefficieny  and  provide  better 
alignment  with  curved  fronts,  the  rectangular  clusters  are  checked  for 
efficiency  by  determining  the  percentage  of  flagged  nodes  in  th ' 
cluster  to  the  total  nodes  in  the  cluster.  If  a  50  percent  efficiency 
not  achieved,  the  rectangle  is  iteratively  bisected  in  the  direction  of 
the  major  axis.  This  is  repeated  until  all  clusters  have  a  50  percent 
efficiency  or  more.  This  nearest  neighbor  clustering  separates  spatially 
distinct  phenomena  as  shown  by  the  dotted  clusters  on  the  two 
dimensional  mesh  of  Figure  2  and  provides  some  linear  alignment  with 
long  curved  gradient  fronts  as  shown  by  the  clusters  in  Figure  3. 


Figure  2.  Two  spatially  distinct  Clusters. 


In  order  to  determine  proper  nods  movement,  as  shown  for  a  one 
dimensional  problem  in  Figure  lb.  the  speed  of  propagation  of  the 
error  clusters  must  be  determined.  Hyman  [19]  and  Hyman  and 
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Figure  3.  Clusters  aligned  with  a  Curved  Front. 

Harten  [18]  move  nodes  hasad  on  minimizing  the  time  variation  of  the 
solution  components.  For  hyperbolic  systems  this  allows  the  mesh  to 
move  at  a  weighted  average  of  the  characteristic  speeds  at  the  node. 
Front  tracking  schemes  also  move  the  "mesh  so  that  isolated 
discontinuities  are  stationary  in  reference  to  the  mesh.  Since  tracking 
error  is  possible  for  all  time  dependent  problems,  cur  cpproch  is  more 
general  and  is  also  an  approximation  to  these  schemes  for  hyperbolic 

problems,  where  error  propagates  in  a  characteristic  direction.  We 
assume  that  nodes  in  the  same  cluster  have  related  solution 
characteristics,  so  that  we  can  determine  individual  node  movement 
from  the  propagation  cf  the  center  of  mass  of  the  error  cluster. 

in  the  Hyman  and  Harten  algorithm  [18],  when  there  is  multiple 
wave  interaction  in  a  vector  system,  the  best  that  can  be  done  is  to 
move  the  mesh  with  a  weighted  average  of  characteristic  velocities.  The 
same  principle  applies  to  our  algorithm  when  muitiple  error  clusters 

have  merged,  the  mesh  is  still  able  to  move  as  their  combined  error 

cluster  moves,  which  is  a  form  of  weighted  averaging.  Comparisons 

between  center  of  mass  propagation  and  characteristic  paths  for  an 
example  problem  are  made  in  Section  4. 

V/e  attempted  to  move  nodes  based  o  a  procedure  that  was 
based  on  extrapolating  for  the  center  of  masses;  however,  chi;  showed 
an  unstable  oscillatory  effect,  indeed,  Coyle  et  a!.  [8]  showed  that 
node  movement  based  u-i  extrapolation  can  be  unstable.  Using  their 
suggestion,  we  stablize  the  movement  by  solving  the  differential 
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equation 


r  ♦  X  r  =  0  ,  (2.2) 

where  r(t)  is  the  position  vector  of  the  center  of  mass  of  an  error 

cluster  and  (  )  :=  d(  )/dt.  Equation  (2.2)  is  conditionally  stable,  and 
when  solved  numerically  with  reasonable  choices  of  X  >  0  the 

oscillations  in  the  mesh  movement  were  no  longer  present. 

We  solve  (2.2)  from,  say,  t  to  t  and  hence,  determine 

n  r*  I 

r(tn+^)  and  the  vector  r(*n  +  -j)  "  r(tn)  which  is  projected  in  the  two 

axial  directions  of  the  rectangular  cluster  to  determine  the  maximum 
movement  (MM)  in  each  axial  direction.  Along  the  two  axial  directions 
the  movement  function  is  one  dimensic<  I-  A  profile  of  the  movement 
function  that  we  use  is  shown  in  Figure  4;  however,  :he  algorithm  is 
designed  to  be  usea  with  any  general  one  dimensional  movement 
function.  Slopes  a  and  b  depend  on  distance  from  the  cluster  to  the 
domain  boundaries  and  other  clusters. 


aovea»nt  in 
axial  direction 


distance  fro« 

con 


Figure  4.  Profile  of  the  Node  Movement  Function. 


As  shown  in  Figure  4,  we  let 

iMM(l-ax)  if  nodes  are  x  distance  ahead  of  leading  edge 

MAj  .  .  =  jMM  if  nodes  are  inside  the  cluster  projection  (2.3) 

m  1  (MM(l-bx)  if  nodes  are  x  distance  behind  trailing  edge  . 

Equction  (2.3)  determines  the  movement  distance  for  nodes  inside  the 

projection  of  the  cluster  on  the  axis,  i.e.,  the  shaded  region  of 
Figure  5.  In  order  to  provide  smooth  node  movement  throughout  the 
domain,  nodes  outside  this  :egion  move  in  a  reduced  amount  as 
determined  by 
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(2.4) 


where  z  is  the  distance  outside  the  cluster  projection  as  shown  in 
Figure  5  and  01AM  is  the  diameter  of  the  Domain. 


node 


movement 

direction 


Cluster 


Figure  5.  Nodes  outside  the  Projection  of  the  Error  Cluster. 


The  generation  of  a  proper  initial  mesh  is  critical  to  the  success 
of  the  mesh  moving  scheme.  Without  refinement  the  mesh  moving 
algorithm  can  not  provide  suitable  errcr  control  unless  the  initial  mesh 
spacing  properly  resolves  initial  data.  An  initial  error  measure 
appropriate  for  the  finite  volume  scheme  on  quadralateral  cells  of 
Section  3  is  the  error  in  interpolating  the  prescribed  initial  condition 
uQ(x,y)  on  each  cell  by  a  biiinear  polynomial.  The  error  on  each  cell 

is  determined  as  the  difference  between  the  value  of  the  initial 
function  and  its  bilinear  ir.terpoiant  at  the  center  of  each  cell. 
Therefore,  the  initial  mesh  must  be  generated  so  that  the  condition 

n/4{u0(xj,yi)*u0(x.,yj.)+u0(xk,yk)+u0(x1,y1)}-u0(x,y)l  <  TOL  (2.5) 

holds  on  each  cell  when  using  the  vertex  and  center  point  labelling  as 
shown  *or  a  general  cel!  in  Figure  6.  TOL  is  a  user  supplied  error 
tolerance.  We  used  the  following  iterative  scheme  to  satisfy  condition 
(2.5)  for  the  computational  examples  of  Section  4: 

1.  Input  domain  boundaries  and  initial  data  function. 

2.  Generate  a  uniform  mesh. 


1  <  T  •  vw.  ■* 


m  j  ^  ^  ^  j ^  V  K*«**^’J  1TJ  V^VT.'  Vi.  a^J  VTiV^i  «r;’V*J 
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3.  Compute  cell  error  from  the  left  hand  side  of  (.2.5). 

4.  Cluster  high  error  nodes  and  move  influenced  nodes  toward 
center  of  clusters  according  to  (2.3)  and  (2.4). 

5.  Smooth  the  mesh  by  solving  the  Euler-Lagrange  equations  of 
Brackbii!  and  Saltzman  [7]. 

6.  Recompute  error  on  _s!ls. 

Repeat 

7.1.  Add  a  mesh  row  and  column  to  divide  cells  with  error 
greater  than  TOL 

7.2.  Smooth  the  mesh  by  the  algorithm  of  Brackbii!  and 
Saltzman  [7] 

7.3.  Recompute  the  error 

Until  the  error  tolerance  condition  (2.5)  is  satisfied. 

Initial  meshes  generated  with  this  algorithm  are  shown  in  Figures 
2  and  8  for  the  initial  condition  functions  (4.6)  and  (4.2)  respectively . 

Oic.Yk) 


Figure  6.  Cell  Labelling  for  Equation  (2.5). 

3.  MacCORMACK  FINITE  VOLUME  SOLVER  AND  ERROR 
ESTIMATION.  In  order  to  test  our  mesh  moving  scheme,  we  used  the 
explicit  finite  volume  MacCormack  scheme  on  nonuniform  quadraiateral 
grids  for  hyperbolic  vector  systems  of  the  form 

ut  "  fxte.y,u,t)  *  g  (x,y,u,t)  -  o,  (3.D 

u(x,y,0)  =  uQ(x,y),  (3.2) 

with  appropriate  well  posed  boundary  conditions. 

We  index  the  nodes  in  a  logically  rectangular  fashion  where  the 
time  dependent  node  locations  have  cartesian  coordinates 

(x.  .(t),y.  .(t))  and  therefo-e,  the  time  derivative  of 
*  /  J  '  /  j 

u(x.  ;(t),y.  - (t) , t)  is  given  by 
*  •  )  •  /  J 
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(3.3) 


’-r-'yv yr- 'r-  *■- trzBssrzwxsw?  yrj vtj vwj ma ktt y., irusv^rj ttvc.F2  v* ^.y-,.. 


u  =  u  x.  .  +  u 

x  i/j  y 

At  time  step  n,  we  approximate 


u 


t  ‘ 


where  xP . 

i/J 

step. 


i.i'V  * 

Ax.  .(t  )/At.  “ 
i/j  r\  n 

(xnr>  - 

i/j 

xP  .)/At 
i/J  n 

(3.4) 

i.i'V  ■ 

r  n  +  T 

(Yi,j  ' 

yP  •)/ At 

xi,J  n 

(3.5) 

Xi,j(V' 

yf,j  =  yi,j(V' 

and  At 

n 

is  the 

current  time 

The  finite  volume  scheme  is  obtained  by  integrating  Equation 

(3.1)  over  each  cell,  where  the  general  cell(i,j)  with  center  (x.  -,y.  .) 

I  /  J  I  r  J 

is  shown  in  Figure  7.  The  area  integrals  involving  the  spatial 
derivat'ves  of  f  and  g  are  converted  to  line  integrals  of  f  and  g 
around  the  cell  boundaries  using  Green's  Theorem.  The  integral  of  the 
time  derivative  term  over  celt (i , j )  is  approximated  by 

//u.dxdy  =  AP.  u  (3.6) 

X  I  /  J  X 

where  AP  .  is  the  area  of  cell (i , j)  at  tirr.estep  n.  The  line  integrals  are 
1  /  J 

approximated  by  using  values  of  the  solution  at  nodes  on  appropriate 
sides  of  the  cell  boundaries.  The  predictor  step  of  the  MacCormack 
scheme  uses  node  valuer,  to  the  left  and  below  the  boundaries,  while 
the  corrector  uses  node  values  to  the  right  and  above  the  boundaries. 

After  substitution  of  (3.4),  (3.5),  (3.6),  and  the  appropriate 

line  integral  approximations  into  Equation  (3.1)  and  using 


F(uP.) 

=  f(uP.) 

♦  (Ax.  .(t  )/At  )uP . 

(3.7) 

i/J 

i/J 

i/j  n  n  i  / 1 

G(uP  .) 
i/J 

=  g(uP  .) 

t,j' 

+  (Ay.  .(t  )/At  )uP . 
xi,j  n  n  i,j 

(3.8) 

where  uPj  is  the  calculated  approximation  to  Uj  j(tp)  to  simplify  the 

function  evaluations,  the  predictor  and  corrector  Equations  (3.9)  and 
(3.10)  for  the  MacCormack  finite  volume  scheme  on  a  moving  mesh  are 
obtained. 
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The  predictor  step  is 
rn+1  _  .  n  A.  . 


Vj  =  Vj  '  Atn/AU  {  F(ur,j)  (yi+1/2,j+1/2  "  yi*1/2/j-l/2) 
F(uM,j)  (yi-1/2,j*1/2  "  yi-l/2,j-1/2)  +  F(uU)  (yi-1/2,j+1/2 

'  F(uU-1)  (yi-1/2,j-1/2  ‘  yi+1/2,j-1/2)  '  G(ui!j) 

(xi+1/2,j+1/2  ‘  xi+1/2,j-1/2)  +  G(uM,j)  (xi-1/2,j+1/2  '  xi-l/2,j-l/2) 

G(Vj)  (xi-1/2,j+1/2  '  xi+l/2,j+1/2)  *  G(ui!j-1)  (xi-1/2,j-1/2 


Vl/2,j-1/2)}* 


(3.9) 


The  corrector  step  is 


n+1  n  .  “n+l  ..  /An  r  r-/“n+1  ^  , 

ui,i  -  1/2  ‘  Ui,j  Ui,j  ‘  VAi,j  t  F(Vl,j>  (yi*1/2,j*l/2  ' 

yi*l/2,j-1/2)  "  F<“U>  (yi-l/2,;*1/2  ’  yi-l/2,j-1/2)  '  F(uU*1) 

(yi-1/2,j*1/2  ‘  V1/2J+1/2*  "  F(ui,j  ]  CyM/2,j-1/2  “  yi^1/2,j-l/2) 

G(uh1l,j)  'Xi+1/2,j*1/2  '  xi+l/2,j-l/2)  +  G(u"j1)  (xi-1/2,j*1/2 

~xi-l/2,j-l/2)  "  G<VjV.  (xi-1/2,j*1/2  ’  xi*1/2,j*1/25  *  G(ui!j1) 


(xi-1/2,j-1/2  "  Vl/2,j-1/2^}- 


(3.10) 


On  a  stationary  rectangular  mesh,  this  scheme  reduces  to  the 
standard  MacCormack  finite  difference  scheme,  which  when  the 
predictor  and  corrector  are  combined  for  a  linear  partial  differential 
equation  is  the  same  as  the  Lax-Wendroff  finite  difference  scheme. 

Accurate  error  estimation  is  important  to  insure  that  user 
tolerances  are  achieved  and  to  refine  proper  regions  when  doing  local 
mesh  refinement.  However,  mesh  moving  is  not  as  sensitive  to  error 
estimation.  As  long  as  the  error  estimator  shows  the  error  propagation, 
proper  error  magnitudes  are  not  necessary.  Therefore,  in  the 
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Figure  8.  Initial  Mesh  for  Example  4.1. 


The  exact  solution  of  this  problem  is 

u(x,y,t)  =/0,  if  C  <  0  (4.4) 

1.C,  if  C  >  0, 

C  =  l-16((xcost+ysint-l/2)^  +  1 .5(ycost-xsint)~) .  (4.5) 

Equations  (4.4)  and  (4.5)  represent  a  moving  elliptical  cone  rotating 
counterclockwise  around  the  origin  with  period  2v.  It  was  proposed  as 
a  test  problem  by  Gottlieb  and  Orszag  [17]  and  we  selected  it  because 
the  rotational  quality  of  the  the  error  region  is  a  good  test  of  a  mesh 
moving  scheme. 

The  initial  m®«h  generated  for  this  problem  is  shown  in  Figure 
8.  This  mesh  has  an  initial  interpolation  error  less  than  0.08.  Figure  9 
shows  the  mesh  at  t  =  1.6,  and  Figure  10  shows  the  mesh  at  t  =  3.2, 
The  nodes  follow  the  moving  cone  to  keep  it  within  the  refined  region. 
The  dashed  lines  on  Figures  8,  9,  and  10  are  the  error  cluster 

rectangles  at  the  appropriate  time  steps.  Figures  11  and  12  show  the 
contour  and  surface  plot  of  the  solution  at  t  =  3.2.  The  dispersion 
error  in  the  form  of  a  wake  behind  the  cone  for  the  moving  mesh 

solution  is  reduced  significantly  from  the  wake  in  the  solution  using 

the  Lax-Wendroff  finite  difference  scheme  on  a  20x20  uniform  stationary 
mesh.  For  comparison  the  contour  plot  and  surface  plot  of  the 

Lax-Wendroff  solution  at  the  same  total  time  t  =  3.2  and  same  number 
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Figure  9.  Mesh  for  Example  4.1  at  t  =  1.6. 


Figure  10.  Mesh  for  Example  4.1  at  t  =  3.2. 


of  time  stepr.  are  shown  in  Figures  13  and  14.  Figure  15  compares  the 
path  of  the  center  of  mass  propagation  usirg  Equation  (2.2)  and  the 
real  characteristic  path  of  tne  peak  of  the  cone.  As  expected  for  this 
scalar  hyperbolic  problem,  the  vectors  for  the  movement  of  the  center 
of  error  mass  determined  by  Equation  (2.2)  closely  approximate  the 
characteristic  vectors  of  the  center  of  the  cone  with  a  maximum 
difference  of  15  percent  in  length  and  direction. 


X 


Figure  15.  Comparison  of  characteristic  path  of  center  of  cone  and 
path  of  center  of  error  mass  as  determined  by  Equation  (2.2)  for 

Example  4.1. 


Example  4.2  This  problem  is  a  scalar  double  rotating  cone 

problem  with  two  symmetric  cones  rotating  counterclockwise  around  the 
origin.  The  problem  is  given  by  Equations  (4.1),  (4.3),  and  new 

initial  conditions  provided  by 

1-16((x-1/2)2*1 .5y“),  if  (x-l/2)2+l .5y2  <  1/16 

u(x,y,0)  =-  1-16((x+1/2)2*1.5y2),  if  (x*l/2)2*1 .5y2  <  1/16  (4.6) 

0,  otherwise 

• 

Figure  16  shows  the  mesh  at  t  =  1.15.  Figure  16  shows  the  poor 

aspect  ratio  and  mesh  distortion  caused  by  the  rotation.  The  mesh 

tangles  as  the  cones  rotate  further.  When  mesh  tangling  occurs  a 
static  rezone  that  creates  a  new  mesh  using  an  algorithm  similar  to  the 
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one  tnst  generated  the  initial  mesh  must  oe  employed.  The  data  for 
the  mesh  can  to  be  obtained  by  interpolation  from  the  calculated 
solution  at  the  nodes. 
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Figure  16.  Distorted  Mesh  of  Example  4.2  at  t  =  l.l5. 


Example  4.3  This  problem  is  an  uncoupled  system  of  moving 
cones  that  pass  through  one  another.  This  causes  the  error  clusters  to 
collide  and  merge,  and  then  later  separate.  The  problem  is  given  in 
Equations  (4.7),  (4.8)  and  (4.9). 


u.  ♦  u  =  0 

t  x 

v.  -  v  =0 

t  x 


v(x,y,0)  = 


u(x,y,0) 


f M6((x*l/2)2*1.5y2),  if  (x*l/2)2*! .5y2<1/16 


K 


otherwise 

o((x-l/2)2M.5v2),  if  (x-l/2)2*1.5y2<1/16 
otherwise 


u(x,y,t)  =  v(x,y,t)  =  0  on  ail  boundaries  of  the  domain 


(4.7a) 

(4.7b) 


(4.8a) 


(4.8b) 


(4.9) 
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Figure  17.  Mesh  of  Example  4.3  at  t  =  0.35,  Clusters  have  merged  into 
single  cluster  centered  at  the  Origin. 

Figure  3  showed  the  initial  mesh  for  this  problem,  and  Figure  17 
shows  the  mesh  at  t  =  0.35,  just  as  the  clusters  have  collided  and 
merged.  From  t  =  0.35  to  t  =  0.9,  the  single  cluster  stays  centered  at 
the  origin  so  the  mesh  does  not  move  during  this  time.  At  t  =  0.9  the 
cones  have  passed  completely  through  one  another,  and  Figure  18 
shows  the  separation  of  the  error  clusters  and  the  movement  of  the 
mesh  toward  the  boundaries,  figure  19  shows  the  mesh  at  t  =  1.3. 
The  cones  and  error  clusters  have  reached  the  domain  boundary  and 
no  further  movement  of  the  mesh  will  take  place  as  the  cones  exit  the 
domain. 


5.  DISCUSSION  AND  CONCLUSIONS.  We  have  described  a  general 
two  dimensional  mesh  moving  technique  based  on  the  nodes  following 
error  propagation  that  is  determined  from  the  movement  of  dusters 
nodes  with  significantly  high  yrror.  This  mesh  moving  was  tested  on 
linear  hyperbolic  problems  having  solutions  with  large  giadients.  Even 
though  mesh  moving  in  two  dimensions  is  dirficuit,  we  are  encouraged 
by  these  initial  results.  The  mesh  moving  algorithm  was  able  to  control 
the  error  rotation  of  the  rotating  cone  in  Example  4.1  and  the  merging 
and  separating  of  error  regions  in  Example  4.3.  The  distortion  of  the 
mesh  in  Example  4.2  showed  the  need  for  static  rezoning  when  such 
distortions  occur. 

We  are  investigating  ways  to  improve  the  efficiency,  reliability, 
and  robustness  of  the  algorithm.  Possible  improvements  include:  not 
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Figure  18.  Mesh  of  Example  4.3  at  t  =  0.9,  Clusters  are  separating 
J  and  moving  coward  the  Domain  Boundaries. 
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Figure  19.  Mesh  of  Example  4  3  whet.  Clusters  reach  the  Domain 

Boundaries. 
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clustering  ot  every  time  step  and  letting  the  mesh  move  at  a  constant 
velocity  *or  several  time  steps,  efficiently  testing  for  mesh  tangleing  or 
distortion,  using  a  better  solver  for  hyperbolic  equations  such  as  the 
monotonic  schemes  of  Osher  [22],  vanLeer  [27],  or  Engquist  [12],  and 
using  better  error  estimates.  We  intend  to  show  the  flexibility  of  the 
mesh  mover  by  implementing  it  with  a  finite  element  solver  for 
parabolic  oroblems. 

Finally,  we  intend  to  implement  local  mesh  refinement  in  the 
algorithm.  We  believe  that  with  such  a  combination  algorithm  efficiency, 
accuracy,  and  robustness  can  be  increased. 
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Numerical  Simulation  of  Fluid  Ejection 


from  a  Spinning  Cylinder 
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Abstract:  A  computer  code,  based  on  a  convective  flux  approximation  on  a 
finite  difference  Eulerian  grid,  was  used  to  model  the  rate  of  fluid  ejection 
from  the  opened  end  of  an  azimuthally  rotating  cylinder.  The  computer  code. 
SOLA-VGF/CSL  is  described  in  a  previous  report  (1).  A  constantly  rotating 
cylinder,  with  an  80^  fluid  fill,  is  set  in  equilibrium  solid  body  rotation 
and  one  end  is  instantaneously  removed,  allowing  the  centrifugal  force  to  drive 
the  fluid  from  the  opened  end.  Fluid  parameters  have  been  chosen  to  model  the 
behavior  of  water  and  glycerin  at  ,!5°C.  The  ratio  of  the  volume  ejection  rate 
to  the  volume  rotation  rate  shows  similarity  when  expressed  as  a  function  of 
the  cylinder  rotation  time.  The  fluid  viscosity  is  observed  to  have  negligible 
effect  on  the  ejection  rate  for  the  spin  rates  of  interest.  This  behavior  is 
shown  to  be  consistent  with  a  dimensional  analysis  of  the  flows  considered. 

I.  Introduction:  The  centrifugal  force  provides  a  mechanism  for  the 
distribution  of  a  liquid  from  the  opened  end  of  a  rapidly  rotating  cylinder. 

In  1974,  Stuempfle  originally  studied  the  centrifugal  distribution  process,  and 
developed  an  analytic  model  (2).  To  further  examine  such  a  distribution  process, 
we  have  used  a  computer  code,  S0LA-V0F/CSL  (1),  to  generate  solutions  to  the 
flow  problem.  In  the  simulation,  a  cylinder,  partially  filled  with  fluid,  is 
initially  set  in  solid-body  rotation.  The  centrifugal  force  holds  the  fluid 
against  the  wall  of  the  cylinder  and  establishes  a  static  pressure  distribution 
which  increases  as  tne  radius  squared.  To  start  the  distribution  process,  one 
end  of  the  cylinder  is  instantaneously  removed  and  the  pressure  along  the  exposed 
surface  of  the  fluid  is  set  to  zero.  The  pressure  due  to  the  centrigugal  force 
then  drives  the  fluid  from  the  end  of  the  cylinder.  The  resulting  flow  is  some¬ 
what  analogus  to  the  flow  which  follows  the  collapse  of  a  dam. 

The  form  of  the  flu'd  distribution  and  the  time  scale  of  the  distribution 
process  are  the  main  concerns  of  this  investigation.  These  are  examined  most 
conveniently  by  considering  the  rate  of  volume  flow  out  of  the  cylinder  as  a 
function  of  time.  Our  objectives  are  to  characterize  the  fluid  distribution  as 
a  function  of  the  kinematic  viscosity  of  the  fluid  and  the  spin  rate  of  the 
cyl inder. 

H.  Flow  Model. 

A.  Initial  Conditions. 

Figure  1  shows  the  conditions  at  the  time  t  =  0.  Due  to  axial  symmetry 
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it  is  necessary  to  display  only  half  of  the  cylinder.  The  cylinder,  which  has 
a  length  L  =  33  cm  and  a  radius  R  =  10  cm,  is  spinning  with  angular 
frequency,  q  ,  about  the  vertical  axis  on  the  left  of  the  figure.  The  fluid 
is  moving  only  in  the  azimuthal  direction.  The  pressure  is  taken  to  be  zero 
in  the  void  surrounding  the  fluid.  Since  the  gravitational  force  is  much 
smaller  than  the  centrifugal  force  in  the  range  of  variables  considered,  the 
gravitational  force  is  neglected.  In  calculating  the  rate  of  volume  flow  out 
of  the  cylinder,  fluid  is  considered  to  have  left  the  cylinder  when  it  passes 
beyond  the  outer  radius  of  the  cylinder  wall. 


B.  Flow  Equations. 

The  equations  which  govern  the  flow  are  the  continuity  equation 
expressing  conservation  of  mass  and  the  Navier-Stokes  equation  expressinq 
conservation  of  momentum.  The  fluid  and  the  flow  are  considered  to  be 
incoinpressable.  The  equations  are 
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We  have  labeled  the  radial,  axial  and  azimuthal  coordinates  as  r,  z  and  9 
respectively,  and  the  respective  velocity  components  are  u,  v  and  w.  The 
kinematic  viscosity,  density  and  pressure  of  the  fluid  are  v  ,  p  and  p,  and 
the  time  is  labeled  t.  We  have  assumed  no  azimuthal  dependence.  These 
equations  are  solved  to  give  the  flow  of  the  fluid  from  the  cylinder.  The 
velocity  components  are  numerically  determined  using  an  explicit  finite 
difference  method  applied  on  a  finite  Eulerian  mesh.  The  fluid  pressure  is 
determined  implicitly,  using  an  iterative  scheme.  Details  of  the  solution 
precedure  are  given  by  Nichols,  Hirt  and  Hotchkiss  (3). 


To  track  the  free  surface  of  the  fluid,  a  fractional  volume  of  fluid 
function  is  used.  This  function  is  denoted  F  and  is  specified  by  the  equation 

If- +  7  l?(rFu)  +  ll(Fv)  -  0  <*> 

The  quantity,  F,  is  carrier  with  the  fluid  and  the  value  of  F  in  any 
particular  mesh  cell  range?  tvt-.’  to  0.  A  value  of  un!4y  corresponds  to  a 
mesh  cell  which  is  entirely  1  'leu  with  fluid,  while  a  value  of  zero 
corresponds  to  a  mesh  cell  which  is  entirely  empty.  Mesh  cells  with 
intermediate  values  of  F  contai'  f>ee  surface.  The  Donor-Acceptor  flux 
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approximation  is  used  to  calculate  sequential  values  of  F  in  the  mesh  cells. 
When  the  values  of  F  are  determined,  free  surface  is  extrapolated  through  the 
regions  where  0  <  F  <  1.  This  scheme  is  illustrated  in  Figure  2.  In 
regions  where  F  =  0,  the  pressure  is  taken  to  be  zero. 

III.  Results: 


A.  Flow  Pattern. 

The  flow  pattern  observed  in  the  calculations  is  shown  in  Figure  3.  Each 
computer-generated  frame  depicts  the  fluid  configuration  at  the  specified 
time.  In  the  calculation  shown,  the  fluid  density  was  1  gr  cm  and  the 
kinematic  viscosity  was  5  stokes.  The  anqular  rotation  rate  of  the  cylinder 
was  2.89  x  10^  sec  .  Line  segments  indicate  the  relative  magnitude  and 
direction  of  the  local  radial  and  axial  velocity  components  in  each  mesh  cell 
within  the  fluid  region.  The  free  surface  is  illustrated  with  a  solid  line. 

B.  Ejection  Rate  Analysis 

1.  Viscosity  Effects:  The  effect  of  the  fluid  viscosity  was  - 
examined  by  comparing  the  ejection  rate  for  fluids  of  density  1  gr  cm  and 
kinematic  viscosities  in  the  range  0?009  to.20  stokes.  The  cylinder  rjtation 
rate  remained  constant  at  5.78  x  10  L  sec  “  .  The  various  fluid  ejection 
rates  are  plotted  as  a  function  of  time  in  Figure  4.  The  ejection  rates  show 
an  initial  peak  at  2  x  10-,i  sec,  followed  by  a  somewhat  fluctuating  slower 
rise  to  about  1.2  x  10-<f  sec,  after  which  the  rate  decays  monotonically.  Half 
of  the  fluid  originally  in  the  cylinder  is  ejected  at  t  =  1.75  x  10“^  sec. 

The  fluctuations  in  the  ejection  rate  are  due  to  the  propogation  of  waves  on 
the  free  surface  of  the  fluid.  The  display  of  the  differential  quantity,  the 
rate  of  fluid  ejection,  magnifies  the  observed  effects  of  the  surface 
disturbances. 

No  significant  differences  are  observed  between  the  ejection  rates 
calculated  with  the  various  values  of  kinematic  viscosity.  This  result  can  be 
interpreted  in  terms  of  the  ratio  of  the  centrifugal  force  to  the  viscous 
force,  in  a  manner  similar  to  the  use  of  the  Reynolds  number  with  inertial 
forces.  The  appropriate  ratio,  C,  is 


where  R  is  the  cylinder  radius,  fi  is  the  angular  rotation  rate  and  v  is 
the  kinematic  viscosity  of  the  fluid.  The  range  of  values  of  the 
centrifugal  ./viscous  force  ratio  for  the  calculations  shown  in  Figure  4  is  2.9 
x  l(r  to  6. A  x  10.  In  these  cases,  the  fluid  ejection  process  is  dominated 
by  the  centrifugal  force,  and  the  viscous  forces  play  no  significant  role. 

To  further  examine  the  characterization  of  the  ejection  rate  by  the.ratio 
C,  calculations  were  performed  with  C  =  1.2  x  10J  and  with  C  =  6.3  x  10  ...  To 
obtain  the  low  values  of  C,  the  spin  rate  was  decreased  to  6.3  x  101  sec  -1 
and  viscosity  values  of  5  and  100  stokes  were  used.  Results  of  these 
calculations  are  shown  in  Figure  5.  A  smaller  ejection  rate  is  observed  for 
the  case  where  C  =  6.3  x  10.  We  conclude  that  the  centrifugal/viscous  force 
ratio  is  useful  for  characterizing  the  ejection  rate.  When  the  value  of  C  is 
greater  than  2.9  x  10  ,  the  viscosity  of  the  fluid  plays  no  significant  part 
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in  the  ejection  process.  Extensive  calculations  at  low  values  of  C  were  not 
performed  because  the  time  increment  limitations,  which  the  code  imposes  to 
control  the  rapid  momentum  transfer  at  high  viscosity,  caused  runs  to  execute 
for  rather  long,  costly  times,  and  viscous  effects  are  not  important  in  our 
region  of  interest. 

2.  Rotation  Rate  Effects:  To  examine  the  effect  of  rotation  rate, 

calculations  were  performed  for  a  fluid  with  a  viscosity?of  5  stokes  and  a  . 

density  of  1  gr  cm  .  Angular  spin  rates  of  2.89  x  10  L  to  1.16  x  lCr  sec"1 
were  selected.  The  calculated  ejection  rates  are  shown  in  Figure  6.  The 
rates  show  a  definite  dependence  on  the  spin  rate.  For  calculations  with 
various  spin  rates,  the  time  at  which  half  of  the  fluid  is  ejected,  T  is 

plotted  as  a  function  of  the  inverse  spin  rate  in  Figure  7.  Figure  7 

demonstrates  that  the  time  scale  of  the  ejection  process  is  inversely 
proportional  to  the  spin  rate. 

In  Figure  8,  the  ejection  rate  for  the  calculations  shown  in  Figure  6, 
i'ave  been  non-dimensional ized  by  dividing  by  the  volume  rotation  rate,  QR, 
g’ven  by  > 

Qr  =  LR2«  (7) 

and  the  nondimensional  values  have  been  plotted  as  a  function  of  the  angle  of 
rotation,  8,  given  by 


8  -  nt  (8) 

Before  nondimensionalization,  the  ejection  rates  were  averaged  over  a  time 
interval  of  2  x  10  ~'3  sec  to  reduce  the  fluctuation  level.  To  within  the 
limits  of  the  smoothing  process  (indicated  on  Figure  8),  the  non- 
dimensionalization  process  reduces  the  ejection  rates  to  the  same  curve. 

Figure  8  shows  that  the  magnitude  and  time  dependence  of  the  ejection  rate  is 
characterized  linearly  by  the  spin  rate.  The  ejection  distribution  and  time 
scale  can  be  determined  from  the  soin  rate  and  the  functional  form  displayed 
in  Figure  8. 

IV.  Summary  and  Conclusions: 

The  fluid  dynamics  simulation  code,  S0LA-V0F/CSL  has  been  used  to 
investigate  aspects  of  the  rate  of  fluid  ejection  from  the  opened  end  of- an 
azimuthal ly  rotating  cylinder.  At  angular  spin  rates  of  5.78  x  l(r  sec"1, 
fluid  viscosities  from  0.009  to  20  stokes  showed  no  significant  effect  on  the 
ejection  rate.  This  behavior  is  consistent  with  the  magnitude  of  the  ratio  o^ 
the  centrifugal  force  to  the  viscous  force.  As  the  ratio  becomes  smaller, 
some  viscous  effects  can  be  observed  with  the  simulation  code.  The  time 
dependence  and  magnitude  of  the  ejection  rate  scales  linearly  with  the  spin 
rate  of  the  cylinder. 
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Figure  1.  Initial  set  up  of  the  flow  problem  showing 
(a)  the  Euler>an  mesh  and  (b)  the  fluid  configuration 
at  time,  t  =  0.  The  radial  and  axial  velocity  components 
are  intitially  zero  in  all  inesh  cells.  The  cylinder 
is  33  cm  high  and  10  cm  in  radius.  The  mesh  is  40  cm 
high  and  10.5  cm  in  radius. 
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Figure  8.  Nondimensional  ejection  rate  for  a 
5.0  stokes  fluid  at  various  spin  rates  vs  the 
rotation  angle. 
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A  Numerical  Algorithm  for  the  Multidimensional, 
Multiphase,  Viscous  Equations  of  Interior  Ballistics 


James  A.  Schmitt* 

Ballistic  Research  Laboratory 
Aberdeen  Proving  Ground MD  21005 


ABSTRACT 

A  numerical  method  based  on  a  linearized  ADI  (Alternating  Direction  Implicit)  scheme  is 
described  in  the  context  of  the  solution  procedure  for  the  nonlinear  partial  differential  equations 
associated  with  an  average  two-phase  (gas-solid),  two-dimensional,  fully  viscous  model  of  interior 
ballistics.  This  method  was  chosen  because  the  linearization  of  the  time-differenced  equations 
within  the  temporal  truncation  error  permits  a  non-iterative  solution  pror  Jure  for  this  implicit 
scheme,  and  the  spotting  of  difference  equations  along  the  coordinate  dir  vns  provides  a  block 
tridiagonal  structure  of  the  solution  matrices.  The  implementation  of  the  algorithm  possesses 
several  novel  features:  the  algorithm  is  derived  in  the  context  of  a  moving  coordinate  system;  the 
non-conservational  form  of  the  governing  equations  avoids  both  mass  sources  (which  can  be 
generated  by  grid  motion)  and  singular  solution  matrices  (which  can  arise  in  regions  of  one-phase 
flow  in  a  two-phase  calculation);  and  Anally,  the  Jacobian  type  matrices  (which  arise  from  the 
linearization  process)  are  determined  by  numerical  differentiation  instead  of  the  usual  analytic 
calculations.  This  numerical  scheme  is  encoded  in  the  DELTA  computer  code. 

Verification  of  the  DELTA  algorithm  is  obtained  by  comparing  simulations  to  an  analytic 
solution  of  an  isentropic  core  flow  and  to  the  two-dimensionsJ  results  of  aa  experiment. 
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1.  INTRODUCTION 


The  flowing  medium  in  a  gun  tube  typically  is  a  mixture  of  a  compressible  gas  and 
burning  solid  propellant  grains.  Details  of  the  flow  are  important  for  weapons 
development,  but  only  bulk  properties  can  be  routinely  measured,  such  as  the  trajectory 
of  the  projectile,  the  pressure  history  at  a  fixed  station,  the  heating  inside  the  gun  tube, 
etc.  Therefore,  a  need  exists  for  a  detailed  mathematical  model  of  interior  ballistics 
two-phase  flows,  and  an  algorithm  to  soive  the  corresponding  equations. 

The  three-dimensional  mathematical  model  is  developed  carefully  in  Reference  [l]. 
References  [2]  -and  [3]  are  shorter  versions  of  Reference  [l].  This  two-phase  model  is 
based  on  instantaneous,  finite  volume,  weighted  averaging,  and  consists  of  nonlinear 
partial  differential  equations,  constitutive  laws  for  the  averaged  variables,  and 
correlations  for  the  interphase  terms.  The  transient  phenomena  included  in  this  model 
are:  the  convection  of  the  phases  driven  by  gas  phase  pressure,  gas  phase  viscous 
stresses,  turbulence,  intergranular  stresses,  interphase  drag,  and  interphase  mass  transfer 
due  to  burning  of  the  solid  grains;  the  change  of  energy  in  the  gas  due  to  convection, 
pressure,  laminar  and  turbulence  dissipation,  conduction,  and  interphase  heat  transfer; 
and  the  change  of  the  geometry  and  number  of  the  burning  grains.  These  phenomena 
occur  within  the  volume  defined  by  the  gun  tube  and  the  base  of  an  accelerating 
projectile.  The  motion  of  the  projectile  and  the  two-phase  flow  field  are  coupled  via  the 
gas  pressure  exerted  on  the  projectile’s  base. 

This  model  is  specialized  to  the  case  of  axial  symmetry  of  the  flow  within  the  tube. 
Appendicies  A  and  B,  which  are  reproduced  from  Reference  [!],  list  all  the  differential 
equations,  constitutive  laws  and  correlations.  This  axial  symmetric  model  and  numerical 
scheme  are  encoded  in  the  DELTA  computer  code.  The  purpose  of  this  paper  is  to 
describe  in  some  detail  the  numerical  algorithm  (Section  2),  and  to  present  some 
validating  computer  runs  of  the  combined  model  and  algorithm  (Section  3). 

Previous  multi-dimensional,  multi-phase  work  applied  to  interior  ballistics  includes 
that  from  Paul  Gough  Associates,  Inc.  References  [4-5],  and  Scientific  Research 
Associates,  Inc.  References  [6-7].  Gough's  work  addresses  the  inviscid  flow  during  the 
ballistic  cycle,  i.e.  the  average  gas  phase  viscous  stresses,  heat  conduction,  and 
turbulence  are  excluded.  Thus,  the  phenomenology  of  pressure  waves  inside  a  gun  tube 
is  modelled  primarily  in  Gough's  work.  The  people  at  Scientific  Research  Associates 
considered  the  viscous  flow  phenomena,  i.e.  the  development  of  boundary  layers  inside  a 
gun  tube.  The  work  reported  here  is  more  similar  to  that  of  Scientific  Research 
Associates,  but  deviates  from  it  in  the  model,  i.e.  the  equations,  choice  of  dependent 
variable,  some  correlations,  and  in  alterations  in  the  numerical  algorithm.  Differences 
and  similarities  in  the  model  are  addressed  in  detail  in  Reference  [l],  and  in  tne 
algorithm  in  Section  2  of  this  paper. 


2.  ALGORITHM 


2.1  Governing  Equations 

The  governing  equations  listed  in  Appendices  A  and  B  are  a  set  of  nonlinear  partial 
differential  equations  which  are  first  order  in  time  and  second  order  in  the  two  spatial 
coordinates.  The  rationale  for  the  specific  form  of  the  equations  is  giv^n  in  Reference  [ij. 
The  general  form  is 

y<  =  G(r,z,f,  y,yny:,yrr,yr:,y::),  (2.1) 

where  the  independent  variables  of  time,  radial  coordinate  and  axial  coordinate  are 
denoted  by  t ,  r,  z,  respectively.  The  vector  of  dependent  variables  is  denoted  by  y,  and 
the  partial  derivatives  of  y  with  respect  to  the  spatial  and  temporal  coordinates  are 
denoted  by  subscripts.  The  components  of  the  vector  y  can  be  the  radial,  circumferential 
(swirl),  axial  components  of  the  gas  phase  velocity  ( u,v,w ).  respectively;  the  radizJ, 
circumferential  (swirl),  axial  components  of  the  solid  phase  velocity  (u*,v*,wr), 
respectively;  the  gas  phase  specific  entropy  s  ;  the  logarithm  of  gas  phase  pressure  q  ; 
the  regression  distance  of  the  solid  phase  d*  ;  the  number  of  particles  m *  ;  the  particle 
surface  temperature  T‘  ;  and  two  variables  to  define  the  turbulence  in  the  Sow  field. 
Thus,  depending  on  the  simulation,  the  number  of  dependent  variables  change  from  a 
minumum  of  four  tc  a  maximum  of  thirteen.  For  the  case  of  an  one-phase,  laminar  Sow 
simulation  with  swirl,  the  dependent  vector  y  has  five  components,  «,  v,  w,  8  and  q.  For 
the  case  of  a  two-phase,  turbulent  flow  simulation  with  ignition  and  burning  of  ihe  solid 
phase,  the  dependent  vector  y  has  nine  components,  u,  w,  u*  w*,  8,  7,  d *,  m*  T*.  This 
assumes  the  absense  of  swirl  and  an  algebraic  turbulence  mode!  (i.e.  the  turbulence 
properties  are  described  only  by  algebraic  relations).  The  components  of  the  vector  G 
are  nonlinear  functions  which  can  depend  on  the  variables  r,  z ,  t,  y,  y,  yv  y„,  yTZ,  y,z. 

The  spatial  domain  is  a  confined  volume  within  a  tube  bounded  in  length  by  a 
stationary  wall  (the  gun  breech)  and  the  base  of  an  accelerating  projectile.  The  radius  of 
the  tube  can  depend  ou  the  axial  position  from  the  breech  whicu  is  denoted  by  Zq.  The 
radial  coordinate  varies  from  the  axis  of  symmetry  to  the  tube  * ali.  The  radial  positions 
of  the  axis  and  wall  are  denoted  by  rA  and  rtf(. ),  respectively.  The  axial  coordinate 
varies  from  the  breech  to  the  base  of  the  projectile.  The  axial  position  of  the  breech  and 
projectile  base  may  have  a  radial  dependence  which  is  denoted  by  Zg(r)  and  zp(r), 
respectively. 

The  projectile  is  assumed  to  move  as  a  rigid  body.  The  unsteady  projectile  motion  is 
governed  by  the  following  equations: 
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where  u'p.rp.mp  denote  the  axial  velocity,  circumferential  velocity  and  mass  of  the 
projectile,  respectively.  The  forces  that  retard  the  motion  of  the  projectile  are  those  due 
to  air  resistance,  friction  between  the  projectile  and  tube  wall,  and  gas  leakage  around 
the  projectile,  and  are  denoted  by  F^,  and  F^,  respectively.  These  retarding  forces 
are  assumed  to  be  kuown  functions.  If  the  tube  is  rifled,  an  additional  phenomena  is 
present  which  causes  the  projectile  to  rotate,  and  its  mass  to  be  effectively  increased  via 
equation  (2.4).  In  this  case  the  angle  of  rifling  Or  is  nonzero,  and  the  moment  of  inertia 
of  the  projectile  4  must  be  given.  Because  the  pressure  p  is  determined  from  the 
solution  of  governing  equations  of  the  flow  field,  which  depends  on  the  value  of  ti>_, 
equations  (2.1)  -  (2.5)  represent  a  coupled  system  with  a  moving  boundary. 


2.2  Numerical  Algorithm 


We  want  to  compute  by  finite  difference  approximations  the  transient  values  of  the 
variables  which  describe  the  fluid  dynamics  of  the  flow  in  the  region  confined  by  the 
inner  tube  wall,  breech,  and  moving  projectile  One  way  to  calculate  in  this  expanding 
computational  region  is  by  an  “accordion”  type  grid  in  the  axial  direction,  i.e.  the  first 
and  last  axial  grid  points  are  attached  to  the  breech  and  projectile,  respectively,  and  the 
mesh  expands  as  the  projectile  accelerates  down  the  tube.  Thus,  the  physical  grid  moves 
in  accordance  with  the  projectile  motion.  With  regard  to  the  spatial  finite  difference 
approximation,  the  goal  is  to  obtain  an  accurate  approximation  to  the  actual  physical 
happening.  It  can  be  shown  that  the  finite  difference  approximations  to  the  physical 
variables  in  the  physical  mesh  are  the  same  whether  one  directly  differences  on  the 
physical  grid,  or  one  differences  on  a  transformed  grid  and  then  transforms  back  to  the 
physical  grid.  Higher  accuracy  in  a  transformed  space  is  not  meaningful  if  it  is  lost  in 
the  transformation  back  to  the  physical  space.  Furthermore,  our  physical  grids  will  be 
orthogonal  or  nearly  orthogonal.  Thus,  we  choose  to  compute  finite  difference 
approximations  to  spatial  derivatives  on  the  hysical  grid.  An  additional  advantage  of 
our  method  is  that  the  governing  equations  aeed  not  be  transformed,  and  thus  are 
simpler  to  understand  and  change  in  the  compute  code. 

The  finite  differencing  of  the  time  derivatives  can  be  of  two  generic  types:  implicit  or 
explicit  (See  Reference  [8j).  The  principal  advantage  of  an  implicit  scheme  is  its  superior 
stability  properties  compared  to  an  explicit  scheme.  For  conversion-diffusion  type 
problems  iike  those  given  by  equation  (2.1),  an  explicit  finite  difference  method  has  two 
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stability  conditions,  the  Courant-Friedriehs-Lewy  condition  and  the  viscous  stability 
limits  In  one  dimension,  these  condition  are: 


At  .-=  CFL 


Ax 
c  +  tl) 


(2.6) 


At  =  VSL  -^-(At)2  , 

2/i 


where  Ax  is  the  spatial  mesh  increment,  c  is  the  sound  speed,  iv  is  the  gas  velocity,  [i  is 
the  viscosity  and  p  is  the  density.  The  constants  CFL  and  VSL  are  less  than  or  equal  to 
one.  For  simulations  which  involve  boundary  layers,  small  grid  sizes  are  necessary. 
Thus,  for  this  type  of  simulation,  the  time  step  (the  size  of  At)  must  be  proportional  to 
the  square  of  the  smallest  grid  increment  for  an  explicit  scheme.  On  the  other  hand, 
most  implicit  schemes  have  no  corresponding  stability  conditions,  and  significantly  larger 
time  steps  based  on  accuracy  considerations  rather  than  stability  can  be  used.  The  basic 
disadvantage  of  implicit  algorithms  is  that  they  tend  to  be  more  complicated  than 
explicit  schemes,  and  thus  more  difficult  to  understand  and  implement.  In  particular, 
applying  a  standard  implicit  scheme  to  system  of  equation  (2.1),  we  obtain  a  nonlinear 
system  of  algebraic  equations  in  the  variables  at  the  new  time  level.  This  system  can  be 
quite  large  and  complex  because  it  possesses  an  equation  for  each  dependent  variable  and 
for  each  grid  point  in  the  two-dimensional  computational  mesh.  These  equations  are 
coupled  via  the  spatial  derivatives.  Iterative  methods  are  the  most  common  solution 
procedure,  but  they  can  be  quite  complex  and  time-consum*ng  for  such  a  general  system. 
To  mitigate  these  undesirable  characteristics,  we  apply  a  method  developed  in 
References  [9-11].  A  salient  feature  of  this  method  is  the  temporal  linearization  of  the 
nonlinear  terms  to  within  the  local  truncation  error  of  the  finite  difference  approximation 
of  the  time  derivative.  The  resulting  system  can  then  be  represented  in  a  matrix 
equation. 


Ay" 


(2.7) 
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where  y"  is  the  vector  of  unknown  dependent  variables  at  the  new  time  level,  and  A 
and  b  are  the  matrix  and  vector  of  values  at  the  known  time  level,  respectively. 
Furthermore,  the  matrix  A  can  be  structured  if  we  decompose  the  time-differenced 
equation  of  (2.1)  into  two  systems  of  equations,  each  of  which  involves  the  spatial 
derivatives  of  the  unknown  variable  in  only  one  coordinate  direction.  This 
decomposition  or  splitting  is  done  so  that  the  error  incurred  is  of  the  order  of  the  local 
temporal  truncation  error,  and  so  t-bat  the  decomposed  or  split  equations  still  form  a 
consistent  approximation  to  equation  (2.1).  When  centered  differences  are  used  to 
approximate  the  spatial  derivatives,  tridiagonal  matrices  are  obtained.  Because  we  are 
dealing  with  a  system  cf  equations,  the  matrices  are  block  tridiagonal  where  the  size  of 
the  blocks  is  equal  to  the  number  of  dependent  variables.  This  method  of  splitting  the 
implicitly  differenced  equations  is  called  an  Alternating  Direction  Implicit  (ADI)  scheme 
(see  Reference  (8]).  Because  we  have  also  linearized  the  equations,  we  shall  refer  to  this 
method  as  a  linearized  ADI  scheme. 

This  general  linearized  ADI  scheme  is  applied  to  the  instantaneous,  finite-volume, 
weighted,  averaged  equations  of  interior  ballistics  with  several  unique  features:  The 
algorithm  is  derived  for  a  moving  coordinate  system,  and  has  no  mass  source  due  to  the 
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motion  of  the  grid.  The  elements  of  the  matrices  derived  by  the  linearization  process  are 
obtained  directly  by  numerical  differentiation  which  bypasses  the  tedious  and  error 
prone  task  of  analytically  deriving  each  element,  and  the  subsequent  coding  of  these 
complex  expressions.  The  spatial  differencing  is  performed  directly  on  nonuniform 
distributed  grid*. 


2.2.1  Algorithm  for  Non- Boundary,  Non-Ccuter-Linc  Points 

We  derive  the  numerical  scheme  for  the  system  of  equations  (2.1)  on  a  moving 
coordinate  system;  i.e.,  the  coordinates  of  the  spatial  grid  system  varies  in  time.  We  let 
the  superscripts  n  denote  the  new  time  level  and  e  the  current  time  level.  The  change  in 
the  j th  coordinate  position  of  a  spatial  coordinate  x  from  level  e  to  n  is  denoted  by 

A*;  =  •-"-*/  =  (|f I  (<•-!*)  =  0(AI),  <'«'«•,  (2.8) 

where  A t  —  t*  -  te ,  A  Taylor  expansion  of  y*  =  yfr8,  z*,  t%)  about  the  current 
values  at  ( re ,  Z*,  tc)  is 

y*  =  ye  +  -^-Af  +  -“-Az  +  -^p*Ar  (2.9) 

dt  dz  or 

+2ArA^+^A’s+-S1H+0(A'S) 

By  adding  (1  -  /?)  times  (2.9)  and  /?  times  a  similar  expansion  to  (2.9)  of  ye  expanded 
about  ym,  and  noting  that  y*  -  ye  =  0( At),  z*  -  f  =  0(At),  and  I*  -  V  =  0(AI), 
we  obtain 

jr--/JA<G*-/JArj|*-j  -/)A*{|£-]  (2.10) 

=  r  +  (1-fflAl  G'  +  (1-fflAr  |  |£-)  +  (M)A*  ( |tj 
+  (/J  -  i)0(A<*)  +  A!*),  }</*<1.0, 

where  Ej{ At'3)  denotes  the  neglected  truncation  error  of  0(Af®).  For  a  stationary  grid 
Ar  =  A z  ==  0,  we  obtain  the  standard  Crank-Nicolson  scheme  for  integration 
parameter  f3  —  -i-  and  tbe  standard  fully  implicit  scheme  for  fi  =  1.0.  Equation  (2.10) 

if 
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is  a  nonlinear  system  of  equations  in  y*  because  G*  is  a  nonlinear  function.  To  make 
(2.10)  a  system  of  linear  equations,  we  linearize  G"  via  a  Taylor  expansion  about  the 
current  level,  i.e. 


G’  =  G‘+  ‘ai  +  BUAP) 

+  De  Ay  +  DRe  Ay  r  +  DZe  Ay  t 
+  DRRC  Ay  „  +  DRZe  Ay„  +  DZZeAy„ 

+  ELk^\ 


(2.11) 


where  E^A?)  denotes  the  neglected  linearization  error  of  0{At2).  The  Jacobian  type 

matrices  are  denoted  by  D,  DR,  DZ,  DRR,  DRZ,  DZZ,  are  evaluated  at  the  current 
time  level,  and  are  defined  as 
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(2.12) 


Upon  substituting  (2.11)  ,nto  (2.10),  we  obtain  the  following  linear  system  of  equations 
m  y  after  algebraic  Manipulation: 
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he  neglected  linearization  error  En  is  nor?  0{  Af3)  because  G*  was  multiplied  by  A/  in 
(2.10).  Thus,  the  linearization  process  does  not  alter  the  order  of  the  temporal  error. 

The  values  yc,  r*,  ze,  tc,  r*  2*,  I*  are  all  known  before  the  start  of  the  integration 
routine  to  determine  the  values  of  y*.  Thus,  the  right  hand  side  of  (2.13)  is  a  known 
vector.  The  left  hand  side  of  (2.13)  can  be  written  as  a  matrix  with  known  values  times 
a  vector  of  unknowns,  y*.  Thus,  (2.13)  has  the  form  of  (2.7).  The  matrix  A 
interconnects  values  of  yw  at  a  grid  point  to  all  the  values  of  its  spatial  neighbors  via  the 

[  aV 

first  and  second  spatial  partial  derivatives  at  level  n.  If  the  term  - 


drdz 


were  absent 


from  the  left  hand  side  of  (2.13),  we  could  decompose  (2.13)  into  two  matrix  equations 
which  are  highly  .structured  and  easily  solvable  while  still  retaining  only  two  time  levels 
of  the  dependent  variable  vector,  i.e.  y*  and  ye.  To  this  end,  we  linearize  the  mixed 
derivative  at  the  n  level  about  the  current  level  and  retain  only  the  leading  term: 
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(2.14) 


Substituting  (2.14)  into  (2.13),  we  obtain  after  some  algebraic  manipulates 

[i  -  W+^'ljy*  =  [i  -  «nt+X¥)]y'  +  ly' 


(2.15a) 


=  [0  -  l)0{Afi)  +  E-AA I3)  +  ELI(At')  +  Eu(At\ 


^-Ar  m  +Az  ra  +A,G‘ 


where  the  operators  are  defined  as  follows: 


(2.15b) 


(2.15c) 


(2.15d) 


The  symbol  I  represents  the  identity  matrix,  and  the  superscript  k  can  be  either  n  or 
c.  The  “lagging”  of  the  mixed  derivative  (2.14)  increased  the  error  of  the  approximation 
to  0{A<-)  for  any  value  of  the  integration  parameter  /?,  but  we  gain  a  structured 


matrix. 


We  can  decompose  (2.15a)  along  coordinate  directions  in  the  following  manner: 
[l -/?£>;]  y'=  [i  -  0Dp‘  +  Ly‘ , 

['  -  w]  yF  =  [i  -  0[\  y'  +  (y'  -  y‘). 


(2.10) 


(2.17) 


Equation  (2.10)  constitutes  the  radial  sweep  of  this  Alternating  Direction  Implicit 

method  because  it  involves  only  spatial  derivatives  in  the  radial  direction  at  the  new 

time  level.  One  solves  this  equation  for  each  fixed  axial  index  and  for  radial  indices 
varying  from  the  axis  of  symmetry  to  the  gun  tube  wall.  When  three  point  centered 
spatial  finite  differences  are  used  to  approximate  the  spatial  derivatives  in  the  radial 
direction,  the  matrix  X  -  is  a  block  tridiagonal  matrix.  The  size  of  each  block  is 
equal  to  the  number  of  dependent  variables.  The  number  of  block  rows  is  equal  to  the 
number  of  grid  points  from  the  axis  of  symmetry  to  the  gun  tube  wall,  denoted  by 

JRXfX.  The  block  rows  from  the  second  grid  points  to  one  away  from  the  wall,  namely 

JR. MX- 1  is  determined  by  (2.16).  The  entries  of  the  first  and  last  block  rows  are 
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determined  from  the  conditions  imposed  at  the  axis  t?  symmetry  and  wall,  respectively. 
The  right  hand  side  of  (2.16)  is  a  known  vector  because  it  is  evaluated  at  the  current 
time-step.  The  solution  of  this  equations  is  the  intermediate  values  of  the  dependent 
variables  y1. 

Equation  (2.17)  constitutes  the  axial  sweep  o?  this  two  sweep  scheme  because  it 
involves  only  the  spatial  derivatives  in  the  axial  direction  at  the  new  time  level.  One 
solves  ihis  equation  for  each  fixed  radial  index  and  for  axial  indices  varying  from,  the 
breech  to  the  base  of  the  projectile.  The  matrix  I  -  /9Z>*  is  a  block  tridiagonal  matrix 
when  three  point  centered  spatial  finite  differences  are  used  to  approximate  the  partial 
derivatives  in  the  axial  direction.  The  size  of  the  blocks  are  the  same  as  in  the  radial 
sweep,  and  the  number  of  block  rows  is  equal  to  the  number  grids  points  placed  from 
the  breech  to  the  projectile  base,  denoted  by  JZMX.  Equation  (2.17)  is  used  to 
determine  the  entries  in  the  block  rows  from  the  second  point  (one  after  the  breech)  to 
JZMX- 1  (one  before  the  projectile  base).  The  entries  in  the  first  and  last  block  rows  are 
determined  from  the  boundary  conditions  imposed  at  the  breech  and  projectile  base, 
respectively.  Because  the  intermediate  value  y‘  is  known,  the  right  hand  side  of  (2.17)  is 
known.  The  solution  of  (2.17)  is  the  value  of  the  dependent  variables  at  the  new  time 
level,  denoted  by  yF.  The  values  of  the  solution  vector  yF  of  (2.17)  and  these  of  the 
solution  vector  yn  of  (2.15)  differ  by  the  time  error  introduced  by  the  splitting  (2.16)- 
(2.17).  To  determine  the  order  of  this  error,  we  substitute  (2.17)  into  (2.16)  and  obtain 

(I  -  ffDr)[(l  -  pD,)yr -  (I  -  0D,)y‘  +  y<]  =  |I  -  0D,\y‘  +  Ly'  (2.18) 
which  simplies  to 

(i  -f>Dr-  = (i  -  /w,  -  mr  +  i r-  PDfiiyF  -r).  (2.19) 

Subtracting  (2.15a)  from  (2.19),  have 

(I  -  pD,  ■■  0D,)(yr  -  jr-)  =  y"  +  y"  -  y‘)  (2.20) 

or 

(I  -  PD,  -  SD,  +  pDM,)( yr  -  y ")  =•■  -pDfiiy'  -  y ').  (2.21 ) 

We  note  that  the  coefficient  of  -  y*  is  0(1),  Dr  and  Dx  are  each  0(Af)  and 
(y*  -  ye)  is  at  least  0(At).  Thus, 

yF  =  y*  +  E^AP),  (2.22) 

that  is,  yF  is  equal  to  y*  to  within  the  local  truncation  error  of  the  scheme. 

Equations  (2.16),  (2.17)  with  the  definitions  (2.15b)-(2.15d)  represent  the  time 
differenced,  linearized,  ADI  scheme.  We  now  turned  to  the  finite  difference 
approximation  oi  the  spatial  derivatives.  The  standard  centered  finite  difference 
approximations  to  the  spatial  partial  derivatives  in  the  coordinate  direction  x  at  the  i  ** 
grid  point  are 


(j£r.|  ~  Mr*-y,-- 1)  h<-i(y;+i  -  y.) 

1  dx\i  hf.iihi  +  At+1)  h{  (A,-  +  A,+  1)  ; 
pV)  ~  2(y-i  ~Yi)  ,  2(y,+1-y,) 

l  dx2 1  hUh{  +  Vi)  T  M*,+  Vi)’ 


(2.23) 


(2.24) 


where  A,-  =  xl+1  -  x,-.  However,  instead  of  (2.23)  we  use 

-=  (y«H  ~  y'-») 
dx  +  A,-.,) 


(2.25) 


(dee  Reference  12).  For  equr'ly  snaced  meshes  (2.23)  and  (2.25)  are  identical, 
nonuniform  spaced  grid,  the  difference  between  then  can  be  expressed  as 


2V  • 1  •'  d£ 


For  a 


(2.26) 


The  nonuniform  grids  that  are  used  in  our  application  have  the  property  that  A,_,  >  A,- 
so  that  (2.26)  acts  as  a  stabilizing  viscous  term  to  the  spatial  differencing. 

To  complete  the  description  of  the  interior  point  algorithm  we  discuss  the 
determination  of  the  Jacobian  type  matrices  D,  DR,  DZ ,  DRR,  DZZ  defined  by  (2.12). 
The  obvious  way  to  evaluate  the  elements  of  these  matrices  is  to  manually  take  the 
partial  derivatives  and  code  each  element.  This  double  procedure  is  very  error  prone 
because  the  right  hand  sides  of  the  equations  are  extremely  coir  dex.  If  there  are  I'-EQ 
variables,  then  each  element  of  the  vector  G  is  a  function  of  5*U  1  +  3  arguments,  in 

general.  Further' '.ore,  for  two-phase  simulations,  the  correlations  aie  not  fixed,  but  can 
v?  •  substantially  from  simulation  to  simulation.  In  these  cases  new  elements  of  the 
matrices  would  have  to  be  determined  and  encoded.  To  avoid  this,  one  can  lag  the 
contribution  of  these  correlations  by  one  time  step.  This  is  not  totally  desirable  because 
one  increases  the  local  truncation  error  (see  the  discussion  near  (2.14))  which  can  be  large 
when  the  correlations  add  significantly  to  the  flow  dynamics  in  a  given  time  step,  e.g., 
burning  of  the  grains.  An  alternative  to  this  whole  procedure  is  to  determine  these 

matrices  numerically.  Consider  the  determination  of  D —  which  can  be 

}  dyj 

approximated  by 


Ay 


G,{r,Z,t,yu  •  •  •  ,yy  +  6  •  •  •  ,VNEQ>Yn 


.y«) 


(2.27) 


-  G,{r,z,t,yu  ■  ■  ,yr  6  -  ■  ■  ,yNEQ,%,  •  •  •  ,yjj  /(2 6) 


where  8  is  the  pre-determined  increment.  Once  these  increments  are  obtained,  the 
elements  of  the  matrices  can  be  computed  tri  .ally  by  repealed  calls  to  s  subroutine 
wbicn  computes  the  right  Land  sides  of  the  equations.  A  eharac .erist ic  of  the  G  ,  s  is 
that  the  terms  yr  and  y  appear  at  most  quadratically,  and  the  -  ’rms  yr  and  y::  appear 
hi  most  linearly.  Thus,  by  using  center^  differences  the  matrices  DR  and  DZ  can  be 
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obtained  exactly  for  any  value  of  the  increment.  Likewise,  using  one-side  differences  the 
matrices  DRR  and  Z)/?Z  can  be  determined  exactly.  In  these  cases,  a  large  value  of  the 
increment  can  he  used  to  avoid  any  round  off  errors.  On  the  other  hand  the  vector  G  is 
a  non-algebraic,  nonlinear  functio;.  of  the  vector  y  In  this  case  one  cannot  obtain  an 
exact  value  of  the  elements  of  D  for  any  value  of  the  increment.  We  have  developed  a 
strategy  to  compute  an  increment  value  based  on  the  current  error  estimates  of  G,-  and 
yy,  that  is,  to  determine  D;j  a  6--  is  ured.  -  The  drawback  of  thi3  approach  is  the 
computing  time  necessary  to  evaluate  the  right  hand  sides  as  often  as  required. 

Finally,  we  address  the  problem  of  artificial  mass  sources  induced  solely  by  the 
motion  of  a  grid  system.  The  problem  was  illustrated  ard  resolved  in  Reference  13. 
There  exist  a  standard  procedure  to  determine  if  mass  sources  occur  in  a  numerical 
scheme  when  the  grids  are  moved.  First  one  assumes  a  constant  flow  field  at  the  current 
level,  secondly  one  applies  the  method  to  compute  the  new  time  level  of  values  on  a 
displaced  grid,  and  finally  one  determines  if  these  new  values  differ  from  the  constant 
values.  Following  this  procedure,  we  assume  that  the  flow  field  variables  are  constants 
which  satisfy  t-ie  partial  differential  equations,  and  assumr  that  A r  5^  C  and  Az  jL  0. 
Consequently,  all  the  spatial  derivatives  at  the  current  time  level  arc  zero  and  (2.13) 
reduces  to 


y;’  -  /?|  Aryra  4-  Az  y* 


(2.28) 


+  A f[/)  y*  +  DR  jy*  +  DZy*  +  DRR  y*  +  DRZ  y'  +  Z?^y;“]| 
—  ye  -  ft  £i  D  ye. 


Using  the  fact  that  the  spatial  derivatives  at  the  current,  level  are  zero,  we  add  zero  to 
(2.28)  in  a  convenient  form  ‘0  obtain 

A(y"  -  ye)  =  0.  (2.29) 

where  A  is  a  matrix.  If  A  is  noasingular,  then  the  solution  of  f 2.29)  is  y*  =  yc.  Thus, 
no  mass  sources  exist.  The  l*ck  of  mass  source*  is  due  to  the  form  01  the  equations,  i.e 
yt  =  G,  and  the  algorithm.  For  a  simpler  sr.i  oi  equations  than  the  one  we  are  solving, 
•md  for  r  3et  in  a  conservation  form  which  are  transformed  to  a  stationary  uniform 
computational  grid,  a  “Geometric  Conservation  Law”  is  needed  to  prevent  mass  sources. 
(See  Reference  13).  Our  method  automatically  avoids  this  other  partial  differential 
equation,  and  thp  u,  ''d  to  obtain  its  solution  at  every  time  step. 


2.£  ?  Algorithm  for  Feint*  nt  the  Center-Line  and  at  the  Solid  Surfaces 


The  meihod  to  obtain  the  new  time  level  of  the  flow  variables  along  the  center-line  is 
different.  To  maintain  the  axial  symmetry  of  tic  flew,  the  physical  conditions  on  the 
flow  are  the  radial  and  circumferential  velocity  components  fo'  both  the  gas  and  particle 
phases  must  be  zero,  a  id  the  first  parrial  derii'atives  with  respect  t.o  the  radial  direction 
at  the  axis  of  symmetry  of  the  remaining  variables  mot  be  terz. 
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The  contribution  from  the  points  on  the  axis  of  symmetry  can  be  done  in  at  least 
two  ways.  The  first  and  simplest  is  to  directly  apply  the  symmetry  conditions.  For  a 
radial  sweep,  the  elements  of  the  first  block  row  of  the  matrix  A  and  known  vector  b 
are  the  finite  difference  approximations  of  these  conditions.  The  axial  sweep  along  the 
center-line  is  performed  after  the  axial  sweeps  along  interior  axial  indices.  The  final 
values  at  the  center-line  are  obtained  using  the  symmetry  conditions,  and  the  final  axial 
sweep  values  of  the  non-ceuter-line  points.  The  second  way  is  mc.\.  complex.  Because 
the  center-line  is  part  of  the  flow  Stic.  ^physically  a  non-boundary),  the  governing  partial 
differential  equations  are  valid  on  the  axis  of  symmetry.  One  may  rewrite  these 
equations  with  the  symmetry  conditions  imposed  in  the  equations  themselves,  and  with 
the  corrsct  limit  conditions  as  the  radial  coordinate  gees  to  zero.  Then,  solve  these  new 
equations  by  exactly  the  some  method  as  described  for  non-boundary  points.  Although 
both  are  coded,  the  ••■mpier  first  option  is  utilised. 

The  boundary  conditions  at  the  solid  surfaces  such  as  the  breed  'ube  wall  and 
projectile  base  can  vary  substantially  with  the  particular  simulation  Th;  makes  a 
genera!  discussion  of  boundary  conditions  difficult.  However,  we  v  d,  discuss  some 
implementations  of  common  types  of  boundary  conditions.  The  simple  functional  form 
boundary  condition  y  =  constant,  and  the  simple  derivative  boundary  condition 

•—  =  constant  are  the  most  trivial  kinds.  Their  finite  difference  approximations  are 
ox 

straightforward,  if  y  is  a  variable  computed  directly  by  a  governing  partial  differential 
equation,  i.e.,  one  of  the  variables  in  (2.1).  However,  if  one  b  is  a  nonlinear  boundary 

condition  of  the  form  /( y)  *  =  0  or  /|  ^~|  ~  ®  »  then  one  must  linearize  function  / 

in  time  in  the  same  manner  as  is  any  component  of  the  nonlinear  \  ~ctor  function  G  in 
(2.1)  and  (2.11). 

Sometimes  a  nonlinear  boundary  condition  can  be  reformulated  as  a  linear  one. 

(6  T\  * 

-^—1  =  0  were  T  is  the  temperature  and  n  is  the 

outward  normal.  In  our  method  the  entropy  s  and  pressure  function  q  are  computed 
directly  from  the  governing  partial  differential  equations.  Thus  T  is  a  nonlinear 

function,  T  —  ?\s,q).  We  ordinarily  would  expand  [  in  a  Tavlor  series  in  time  to 

l  tfnj 


obtain  a  linear  function  in  aB  and  q*.  However,  for  a  Noble-Abel  equation  of  state,  we 
have  a  simplification.  We  use  the  chain  rule  to  obtain 


87  _dj_  «T  dq  j 
ds  dn  Jq  dn  j 


•lil  1=0. 


(2.30) 


Since  0  ,  then  (2.30)  can  be  written  as 

os 


=  0. 


(2.31) 


dT 

da  dq  dq 
dn  +  dT  dn 
da 

By  noting  that  the  ratio  of  partial  derivatives  of  temperature  is  a  constant,  (2.31)  is  a 
linear  function  in  s"  and  qn  Thus,  (2.31)  can  be  finite  differenced  and  incorporated 
directly  into  the  matrices  of  tht  linearized  ADI  method. 


2.2.3  The  Order  of  ihe  Sweep t 


Tfe  o^der  of  the  sweeps  is  mainly  a  bookkeeping  problem,  and  should  not  have  a 
large  '•fieri  on  the  solution.  We  have  used  the  following  procedure.  First,  radial  sweeps 
from  tne  center-line  to  the  tube  wall  are  performed  along  constant  axial  indices  to  obtain 
intcii.-.  «iiate  values  of  the  variables  y  using  (2.16).  The  axial  index  varies  from  the  one 
after  ihe  breech  to  the  one  before  the  projectile  base.  Second,  axial  sweeps  from  the 
breech  to  the  projectile  base  are  performed  along  constant  radial  indices  to  obtain  values 
of  the  variables  y  at  the  new  time  Ie\Vi  using  (2.17).  The  radial  index  varies  from  the 
one  after  the  axis  of  symmetry  to  the  one  before  the  gun  tube  wall.  Third,  the 
symmetry  conditions  are  applied  to  determine  the  new  time  level  values  at  all  points  on 
the  axis  of  symmetry  by  using  the  resul's  of  the  axial  sweep  (step  two).  Fourth,  the 
value  of  the  variables  at  the  new  time  level  at  all  grid  points  on  the  wall  a.-e  determined 
by  imposing  the  wall  boundary  condition  using,  if  necessary,  the  resuits  of  the  axial 
sweep  (step  two). 

We  note  that  the  radial  sweeps  along  the  breech  and  projectile  are  avoided.  The 
justification  is  that  in  many  applications  the  boundary  conditions  at  the  breech  and 
projectile  do  not  involve  radial  derivatives.  Consequently,  the  linearized  version  of  these 
conditions  can  be  expressed  without  the  need  for  the  determination  of  the  intermediate 
vaiues  of  the  variables.  Recall  that  the  matrix  D  (equation  (2.12))  was  included  in  the 
radial  sweep.  (See  equations  (2.15b)  &  (2.16)).  However,  it  could  have  been 
incorporated  into  the  axial  sweep  formula.  (See  equations  (2.15c)  &  (2.17)).  If  no  radial 
derivatives  exist  for  the  boundary  conditions,  we  3void  the  radial  sweep  and  incorporate 
the  D  matrix  in  the  axial  sweep  formulation.  An  example  of  this  type  of  boundary 
condition  is  the  gas  continuity  equation  used  to  determine  the  transformed  pressure  q  at 
the  breech  for  a  one-phase,  viscous  simulation..  Imposing  the  no-slip  conditions 
ti  =  v  =  to  —  0  in  the  continuity  equation  evaluated  at  the  breech,  ail  radial 
derivatives  vanish,  and  the  above  method  is  applied  directly  ”jring  the  axial  sweep 
where  one-sided  finite  differences  rre  used  to  approximate  the  spatial  derivatives  at  the 
breech.  If  ona  use?  the  normal  velocity  equation  for  the  determination  of  q  at  the 
projectile  base,  for  example,  some  radial  derivatr/es  remain  in  the  viscous  terms.  Only 
by  lagging  them  could  one  use  the  above  approach. 
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a.  RESULTS 


The  results  of  two  simulations  using  the  DELTA  code  are  presented  in  this  section. 
These  particular  calculations  are  selected  because  they  can  be  compared  to 
independently  determined  answers,  and  thus,  give  some  verification  of  the  code’s 
accuracy  and  capabilities.  Both  eases  involve  a  one-phase  gas  expansion  in  a  con?'  cat 
cross-section  tube  closed  at  one  end  by  a  stationary  surface  called  the  breech,  and  at  the 
other  end  by  a  movable  piston  called  the  projectile.  The  b*eech  and  projectile  base  are 
assumed  to  be  flat  surfaces.  The  initial  states  of  the  bas  are  uniform  and  quiescent,  but 
the  gas  pressures  are  great  enough  to  accelerate  the  projectile  through  the  tube.  The 
controlling  mechanisms  of  .he  expansion  flows  are  the  same,  namely  the  propagation  of 
the  rarefaction  wave,  generated  by  the  projectile  displacement,  and  its  reflection  from 
the  breech,  then  the  projectile,  and  so  forth.  However,  the  gas  pressure  levels  differ 
greatly  between  the  simulations,  and  the  subsequent  flows  are  in  different  regimes. 

TABLE  1.  Geometry  and  Gas  Properties  for  the  150-mm  Gun 
and  Bicen-Whitelaw  E  ApCi!  meats 


150-mm  Gun 


Description 


Bicen-Whitelaw 


150.0 

1.698 

6.0 

50.0 

0.C01 

1.22 

621.09 

2666.8 


Bore  Diameter  (mm)  76.7 

Initial  Projectile  Displacement  (m)  0.1773 

Maximum  Travel  of  Projectile  (m)  0.3 

Projectile  Mass  (kg)  2.54 

Covolume  (m5/kg)  0.0 

Ratio  of  specific  heats,  7  1.4 

Initial  Pressure  (MPa)  0.28 

Initial  Temperature  (K)  293.0 


The  first  simulation  corresponds  to  the  gas  expansion  within  a  150-mm  tube  away 
from  the  effects  of  the  tube  wall  (core-flow)  under  ballistic  conditions.  The  specifications 
for  this  case  are  given  by  the  first  column  of  Table  I.  The  analytic  solution  of  the  one¬ 
dimensional,  inviscid  gas  equations  governing  the  flow  within  this  150-mm  tube  at 
certain  positions  from  the  breach  and  for  specified  times  was  obtained  by  Love  and 
Pidduck.  (See  Reference  [14].)  Their  solution  is  valid  under  the  assumptions  of  isentropic 
expansion  of  each  element  of  gas,  of  constant  covolume,  of  an  even  integer  value  of  the 
ratio  (7  +  l}/( 7  -  1) ,  where  7  is  the  ratio  of  specific  heats,  and  the  frictionless  motion  of 
the  projectile.  The  solution  is  in  terms  of  truncated  power  series,  and  becomes  move 
complicated  as  the  number  of  rarefaction  wave  reflections  increase.  For  the  one- 
dimensional  DELTA  calculation,  the  computational  mesh  (covering  the  enclosed  cavity 
behind  the  projectile)  consisted  of  four  equidistant  mesh  lines  parallel  to  the  a\w  of 
symmetry  and  89  uniformly  spaced  grid  lines  orthogonal  to  the  axis  of  symmetry.  To 
maintain  a  one-dimensional  simulation  for  comparison  to  the  Love  &  Pidduck  results, 
the  following  conditions 
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are  imposed  along  both  the  tube  wall  and  axis  of  symmetry.  The  boundary  conditions 
at  the  breech  and  projectile  are  no-slip  velocity  and  adiabatic  walls.  Love  and  Pidduck 
developed  their  solution  with  the  assumption  of  an  inviscid  isentropic  flow  which  allowed 
them  to  use  a  special  form  of  the  Noble-Abel  equation  of  state,  namely 

7  7 

P  I - i?  =  Pr,  f— —  tj  )  =  constant, 

l  P  1  (  Po  \ 

where  subscript  zero  indicate  their  initial  values.  However,  in  the  DELTA  simulation 
viscous  effects  are  included  and  the  general  form  of  the  Noble-Abel  equation  of  state. 
However,  the  snecial  form  of  the  equation  of  state  used  by  Love  and  Pidduck  b 
maintained  to  within  less  than  two  percent  in  the  DELTA  simulations.  Thus,  the  non- 
isentropic  and  viscid  effects  included  in  the  general  framework  of  DELTA  are  minor  for 
this  one-dimensional  flow,  and  the  comparison  of  the  analytic  solution  and  numerical 
results  is  reasonable. 

The  comparison  of  the  pressure  histories  at  the  projectile  base  is  given  in  Figure  i. 
The  change  in  slope  in  the  pressure  curve  is  due  to  the  first  reflection  of  the  rarefaction 
wave  at  the  projectile  base.  The  magnitude  of  the  slope  discontinuity  cf  the  pressure 
curve  decreases  with  time  due  to  the  equilibration  of  the  pressures  during  the  gas 
expansion.  Another  slope  change  exists  theoretically  at  7.137ms,  although  it  cannot  be 
detected  even  in  the  graph  of  the  analytical  results.  Figure  2  is  similar  to  Figure  1 
except  that  the  pressures  at  the  breech  are  compared  The  effect  of  the  first  arrival  of 
the  wave  at  the  breech  is  ic'-re  obvious  in  both  solutions. 

Figures  C  and  4  show  comparisons  of  the  histories  of  the  projectile  velocity  and 
projectile  displacement  from  the  breech,  respectively.  The  large  values  of  the  tangent  to 
the  curve  in  Figure  3  indicate  the  extreme  acceleration  the  projectile  experiences.  The 
agreement  of  the  results  show  that  the  numerical  solution  of  the  partial  differential 
equations  governing  the  gas  motion  using  the  DELTA  algorithm  is  correctly  ,^upled  to 
the  proper  solution  of  the  projectile  motion.  Comparisons  of  the  pressure  profiles  from 
the  breech  to  cue  projectile  at  specific  1  times  are  given  in  Figures  5  and  6.  The  ranges 
of  pressure  values  on  the  ordinate  are  the  smallest  possible  to  provide  accurate 
compaiisons.  Figure  5  shows  a  comparison  of  the  pressure  values  at  2.898ms,  that  is, 
after  the  rarefaction  wave  has  been  reflected  from  the  breech,  then  the  projectile  base, 
and  is  approximately  halfway  between  them.  The  DELTA  calculation  differs  by  0.6%  at 
most  from  the  analytic  solution  values.  The  slope  discontinuity  is  smeared  out  in  the 
numerical  calculation.  Figure  6  shows  the  pressure  profiles  at  10.23ms  which  is  aear 
tube-exit  time  of  the  projectile.  The  maximum  discrepancy  between  the  two  results  is 
approximately  1.2%.  Because  the  analytic  solution  is  a  truncated  power  series  solution, 
it  is  difficult  to  determine  which  values  are  more  accurate. 

Next  we  compare  a  DELTA  simulation  with  iaser-Doppler  anemor  .try,  time-resolved 
measurements  of  the  axial  velocity  field  inside  a  kibe  behind  a  c’owly  acceleratin6 
projectile.  The  experiment  was  performed  at  Imperial  College,  London  under  funding  by 
European  Research  Office  of  the  Army  and  "he  Ballistic  Research  Laboratory,  and  is 
reported  in  Reference  [15).  This  expeiiment  is  important  to  the  development  of  DELTA 


664 


TL-rji.^.V^  '  J  "J  -Jl -.T -1  ’  V  »'V  •!<  M  f  vw 


gy  V\  V  «•'  V  VI*  1A  1A  IV  l%  l*J  t  V  f  .'■'  I  -v  'A^  .VA  4.T 


because  it  provides  the  first  transient,  two-dimensional  measurements  of  a  quantity  to 
which  the  results  of  a  DELTA  simulation  can  be  compared.  The  schematic  of  the 
apparatus  is  given  in  Figure  7.  Nitrogen  is  the  gas,  and  the  other  characteristic  of  the 
experiment  are  given  in  the  second  column  of  Table  1. 

Three  important  conclusions  of  this  study  are:  The  maximum  intensity  level  of  the 
turbulence  was  approximately  four  percent  which  implies  a  very  low  level  of  turbulence, 
the  tube  wall  boundary  layer  remained  laminar,  and  the  heat  transfer  to  the  tube  wall 
was  minimal.  Consequently,  a  laminar  fiow  simulation  with  adiabatic  walls  should 
approximate  thb  experiment.  The  two-dimensional  computational  grid  had  33  uniformly 
distributed  points  in  the  axial  direction,  and  19  nonuniformly  distributed  points  in  the 
radial  direction.  The  radial  grid  was  such  that,  while  19  points  spanned  the  distance 
from  the  axis  of  symmetry  (r  —  0)  to  the  wall  (r  =  38.35mm),  12  points  were 
distributed  from  r  =  32mm  to  the  wall  and  5  points  were  distributed  from 
r  —  37.85mm  to  the  wall.  The  maximum  and  minimum  distance  between  the  grid 
points  were  6.5mm  and  70pm,  respectively.  Constant  values  were  used  for  the 
coefficients  of  viscosity  and  thermal  conductivity  of  nitrogen,  namely,  17.07p(Pa-s)  and 
O.G2524W/(m-K;,  respectively.  Because  the  experimental  apparatus  was  mounted 
vertically,  the  equa<"'n  for  the  projectile  motion  (2.3)  was  changed  to  include  its 
acceleration  due  to  gravity.  Because  the  total  regarding  force  (FQ  -f  FQ  +  F  in  (2.3)  ) 
experienced  by  the  projectile  was  not  determined  ny  the  experiment,  no  values  of  these 
forces  could  be  assigned.  Thus,  a  total  retarding  force  profile  versus  axial  displacement 
was  obtained  so  that  the  axial  velocity  of  the  projectile  determined  by  the  DELTA  code 
matched  the  experimental  values  as  shown  in  Figure  8.  Because  .-he  projectile  velocity 
values  agree,  so  must  the  projectile  displacement  values.  Figure  9  compares  the  axial 
velocity  profiles  along  the  axis  of  symmetry  at  various  times.  Both  the  DELTA  and 
experimental  results  show  a  linear  profile  from  the  zero  value  at  the  breech  to  the  value 
of  the  projectile  velocity  for  each  time.  The  axial  velocity  histories  at  76.7mm  from  the 
breech  and  at  0.5,  1.0,  2.0,  and  3.0mm  from  the  tube  wall  are  compared  in  Figure  10. 
The  values  from  the  calculation  are  within  the  scatter  of  the  experimental  data  at  radial 
positions  of  1,  2  and  3mm  from  the  wall.  However,  the  discrepancy  between  the  values 
at  the  0.5mm  position  r creases  with  time.  The  same  quantities  arc  graphed  in  Figure 
11  but  at  i 53.4mm  from  the  breech.  The  comparisons  in  Figure  11  show  similar 
behavior  to  that  in  Figure  10,  but  with  the  discrepancy  at  the  0.5mm  position 
considerably  larger.  After  a  discussion  with  the  experimentalists,  it  was  agreed  that 
these  most  difficult,  measurements  at  0.5mm  from  the  tube  wall  are  likely  to  contain 
errors  and  should  be  redone.  We  are  presently  awaiting  accurate  measurements  in  the 
sub-millimeter  range. 


4.  SUMMARY 

The  numerical  algorithm  encoded  in  the  DELTA  computer  code,  and  comparisons  of 
its  calculations  to  an  analytic  solution  and  experimental  measures  are  described. 

A  numerical  algorithm  to  solve  the  two-dimensional,  axisyinmefric,  unsteady,  finite 
volume,  weighted  averaged  two-phase  equations  which  govern  certain  flows  inside  a  gun 
tube  i3  discussed.  These  equations  are  in  their  most  general  form.  In  particular,  when 
the  flow  regime  is  governed  only  by  convection  and  pressure  forces,  this  general  form 
automatically  gives  the  solution  of  the  corresponding  inviscid  equations.  When  in  the 


bound'try  layer  regime,  this  genera!  form  gives  without  any  assumptions,  the  solution  in 
the  boundary  layer  where  viscous  forces  dominate.  Moreover,  this  approach  naturally 
provides  all  tlie  coupling  between  different  phenomena  because  only  one  set  of  equations, 
which  govern  all  the  phenomena,  is  solved.  Thus,  phenomena  that  is  controlled  by 
basically  inviscid  flow  but  exists  because  of  viscous  forces,  like  the  additive  particle  laden 
gun  tube  wall  boundary  layer  which  governs  heat  transfer  to  the  gun  tube,  can  be 
studied  for  the  first  time  without  assumptions  on  the  natures  of  the  core  flow  or 
boundary  layer,  the  validity  of  heat-transfer  correlations,  and/or  the  intra-flow  coupling 
between  various  regimes. 

Two  examples  of  calculations  with  the  DELTA  code  are  presented.  These 
computations  are  limited  to  one-phase  expansion  flows  in  an  adiabatic  tube  so  to  allow 
comparisons  with  independently  obtained  data.  More  realistic  calculations  that  involve 
ballistic  environments,  heat  transfer  to  the  gun  tube  wall,  and  other  phenomena  are 
presented  in  Reference  [16]. 
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Figure  3.  The  comparison  of  the  histories  of  the  projectile  velocity 
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Figure  10. 


The  comparison  of  the  axial  velocity  histories  at 
77,6mm  from  the  breech. 
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(BOPEIKING  EQUATIONS  POE  AXIALLT  SYMMETRIC  PLOWS  II  CTUKOtZGAL  OOOtD  MATES 


This  appendix  contains  a  Hat  of  the  governing  equations  In  component 
fora  In  cylindrical  coordinates  for  the  case  of  axially  sj’aaet rlc  flow.  The 
subscripted  variables  denote  the  components  of  a  vector  and  not  the 
derivative  of  these  variables.  All  derivatives  are  written  in  a  non- 
abbreviated  fora.  The  listed  equations  are  in  a  fora  which  is  compatible 
with  Sq.  (4,32),  Sect.  4.3.  The  components  of  the  gas  average  velocity  and 
the  particle  average  velocity  are 
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where  the  subscripts  r,  6,  and  z  refer  to  tbe  radial,  angular,  and  axial 
coordinate  directions,  respectively.  The  coaponents  of  the  gradient  of  a 
scalar  f  are 


(A.3) 


The  divergence  of  a  vector  F  -  (Fr,  F„ ,  Ft)  is 
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The  independent  variables  are  tiae  t,  radial  position  r,  and  axial 
position  z.  The  dependent  average  variables  which  are  computed  fora  the 
governing  partial  differential  equations  are:  tbe  specific  entropy  »,  the 
pressure  logarithm  function  q,  the  radial  gas  velocity  ur,  the  A 

circumferential  gas  velocity  ^ ,  the  circumferential  particle  velocity  , 

?h*  particle  velocity  u^,  the  number  of  particles  within  the  averaging 

volume  a,  the  regression  distance  d,  and  the  surface  temperature  of  the 
particles  $. 

The  entropy  equation  is 
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where  p,  p,  T,  B,  r,  ere  given  by  Eqs.  (B.6),  (B.4),  (B.2),  (B.27),  end 
(*.26),  respectively.  Ihe  expression,  for  B  is 
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end  the  porosity  a  is  given  by  Eq.  (B.  1).  Ihe  dissipation  function  t  is 
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end  v  „  X,  <♦>,  end  ♦T,  ere  given  by  Eqs.  (B.7),  (B.8),  (B.13),  end  (B.35), 
respectively.  Ihe  heat  conduction  tera  ¥  is  given  by  Eq.  (B. 15)  as 
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^particle*  *  *re  Siven  by  Eqs.  ( B. 17),  (B. 14),  respectively,  end  T* 
discussed  near  Eq.  (B.36).  " 

The  pressure  logarithm  function  equation  is 
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where  p,  e,  T,  B,  e,  B,  T,  ♦,  and  f  are  given  by  Eqs.  (B.4),  (B.3),  (B.2), 
(A.6),  (3.28),  (B.27),  (3.26),  (A.7),  and  (A. 9),  respectively. 


The  radial  gas  velocity  equation  is 

2 


aU'4.^  -iE  111.  f  *M. 
at  ur  lr  u*  a*  r  dqp’Tr  ^ 


3u 


3u 


(A. 13) 


(l~q) 

n 


( Adrag'  r  +  ^  Avis  J  r  +  ( Aturb)  r 


where 


£ 

is 


v 

tf 

V 

V 
a 


« 


3 

L* 

g 

ft 


6 


§ 


K 


E 


& 

V 

I  r 


\c 


to 


i.- 

iV 


LJ 


is 

t: 


•A 

& 


. ..  %  %  .,K 

V;«r  -for 


where 


App.  A 


3u  3u 


3u_  3u 


(4.i»c>*  ■  iz  In  Myr  +  sr^  +  r  tyr  +  sr) 


(A. 18) 


3u_  3u_ 


3u_  3u_ 


+  n  !“"§  ?2  rr  '  yr  ‘  r)  *  aXk r  +  if  *  rW 


and  p,  p,  T,  a,  p,  tod  X  are  given  by  Eqs.  (B.6),  (B.4),  (B.26),  (B.  1), 
(B.7),  and  (B.8),  respectively.  The  axial  component  of  the  drag  1® 

given  by  the  axial  component  of  Eq.  (B.20).  The  axial  component  of  the 
acceleration  due  io  turbulence  could  be  given  by  the  axial 

component  of  Eq*.  (.8.34)  which  Is  Eq.  (A.  1C)  with  p  and  X  replaced  by  and 

xT  • 

The  components  of  rhe  solid  phase  velocity  equation  are  the  radial 
solid  phase  velocity  equation 
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the  circuaferential  solid  phase  velocity  equation 
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and  the  axial  solid  phase  velocity  equation 


3 1  ~  ur  3r  ~  uz  3z  ~  dq  ^  Ifz  +  ^  ^rag^z  +  ^tress^z  (A«21) 

p  o 

where  p  and  p  are  given^by  Eqs.  (B.6)  and  (B.4),  respectively.  The  density 
of  the  solid  phase  p  is  assumed  constant*  The  components  of  the 
accelerations  due  to  drag,  A^  ,  and  intergranular  stress,  ^f;reg8»  arti 
given  by  the  components  of  Eqs.  (B.20)  and  (B.23),  respectively. 

The  particle  number  equation  Is 
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This  copendix  contains  a  lint  of  correlation  model  formulas .  The 
formal  <‘ ere  discussed  in  detail  in  Section  4.7.  The  terms  listed  in  this 
appendix  are  in  a  fora  compatible  with  Eq.  (4.32),  Section  4.3,  and  Appendix 
A. 


The  porosity  or  gas  volume  fraction  (Section  4.2.1)  is  given  by 
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The  equations  of  state  (Section  4.7.1)  are 
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where  R  ■  8.3143  J/(mol*K)  la  the  universal  gas  constant,  M  (kg/mol)  Is  the 
■solar  mas  and  n  (m^/kg)  Is  the  covolume .  The  pressure  logarithm  function  q 
is  defined  by  (Section  4.2.2) 
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Qj 
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q  -  qj[  ln(p/p. )  +  l]  ,  Pa,  or  p  «*  p,exp  (J-  -  l)  ,  Pa  . 

ql  (B.6) 


The  shear  viscosity  coefficient  p  and  the  bulk  viscosity  coefficient 
A  are  (Section  4 .7  <>2) 
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The  acceleration  by  viscosity  is  modeled  by  (Section  4.7.2) 

^viec  "  7  *  {«[ 2uE  +  (X  --|w)  (trace  E)  l]},  a/e2  ,  (B.9) 

where  E  Is  the  etrsln  rata  tensor  computed  using  the  average  velocities, 
l.e. , 

E  -  0.5  (Vu  +  (Vu)T)  .  (B.10) 


The  heat  dissipation  function  tera  is  modeled  by  v Section  4.7.3) 
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and  *T  is  given  by  Eq.  (b.35). 

The  theraal  conductivity  coefficient  k  is  modeled  by  (Section  4.7.4) 
-1.5 
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The  heat  conduction  tera  in  the  governing  equations  is  modeled  by  (Section 
4.7.4) 
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In  Eq.  (B.  19),  c  in  the  particle  eaigftlv|ty,  Og_  *  5.67032*10”®  tf*a“2K”^  la 
the  Stephan- Boltzaan  constant,  and  T  la  the  average  grain  surface 
teaperature.  The  turbulent  heat  flux  within  the  gas,  ?£urb  is  given  by  Eqs. 
(B.36)  and  (B.37). 

The  acceleration  tera  due  to  the  drag  between  gas  and  particles  is 
aodeled  hr  (Section  4.7.5). 
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The  acceleration  term  due  to  Intergranular  stress  la  aodeled  by  (Section 

4.7.6) 
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where  a(a)  is  a  sound  speed  function  for  the  particulate  phase.  The 
function  is  modeled  by 
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The  burning  rate  it-  aodeled  by  (Section  4.7.7) 
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The  source  tera  T  is  (Section  4.7.8) 
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The  enthalpy  factor  H  of  the  source  tera  (Section  4.7.8)  is  defined  by 
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A  tubular  grain  may  be  defined  by  it*  initial  height  end  the  initial 

outer  and  inner  diameters,  D  and  d  ,  respectively.  Let 
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The  grain  la  completely  burnt  if  either  J?-j»  < 0  or  1*0.  If  both  of  these 
quantities  are  positive,  then  the  grain  geometry  functions  are 

vp  -  * (Dq  +  dQ)  (R-r)L/ 2  , 

•p  “  *  (Dq  +  dQ) (R-rrL) 

ap  -  (2 RL  +  %(I?  -  r2;;/2  ,  |(B.33) 

Bp  -  (2R+L  )/2 

Ve  consider  a  detailed  atudy  of  turbulence  models  for  Interior 
ballistics  flows  to  be  outside  the  scope  of  this  report*  Bence,  the 
correlation  models  are  quite  elementary  and  are  listed  in  this  report  only 
for  completeness.  The  acceleration  by  the  gas  phase  turbulent  stress  tensor 
Atur^  and  the  turbulent  heat  ^dissipation  function  f^,  could  have  the  same 
form  e*  Ayigc  (Eq.  (B.9))  and  ♦(2),  (Eq.  (B.12)),  respectively*  but 
different  viscosity  coefficients,  that  is. 
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where  p_  and  1_  denote  the  viscosity  coefficients  for  turbulent  flows.  The 
«nner  in  which  these  coefficients  ere  determined  strongly  depends  an  the 
particular  turbulence  model  one  uses  and,  hence,  will  not  be  given.*  As 
discussed  in  Section  4.7.6,  the  solid  phsse  turbulent  stress  tensor  H^.  is 
set  to  zero.  The  turbulent  heat  flux  vector  Qy  is  aodeled  by  Ishii*3  and 
Gibeling  et  al»3  as 

Qy  -  -  *t[VT  -  |2.  (Tt  -  7)]  ,  W/m2  ,  (B.36) 

<r 

where  is  an  average  temperature  on  the  Interface  (a  function  of  T  and  T) 
and  is  given  by  an  algebraic  forsriia  Involving  an  effective  viscosity  and 
Prandtl  number.  The  corresponding  model  of  fturb  In  E<j.  vB.  15)  is 

*turb  "  "  ~?  ♦  »/(kg«  K)  .  (B.37) 
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ABSTRACT .  Factors  are  discussed  which  govern  evaporation  of  liquid 
in  the  small  capillaries  of  a  porous  medium.  Attention  is  directad  to 
sheet-like  aggregates  from  which  the  vapor  can  escape  with  little 
obstruction.  Marked  temperature  gradients  are  then  found  to  be  confined 
to  close  neighborhoods  of  the  menisci  and  evaporation  is  shown  to  proceed 
in  statiscaily  quite  unstable  configurations  under  a  dynamic  balance  of 
surface  tension,  local  evaporation  rate  and  viscous  shear.  Estimates 
of  evaporation  rates  and  fluid  velocities  are  given.  The  results 
discouicge  constitutive  theories  for  porot'i  media  because  mere  size  of 
capillaries,  independently  of  shape  and  chemistry,  is  found  to  change  the 
physical  processes  underlying  macroscopic  behavior. 

I,  INTRODUCTION.  The  following  study  was  prompted  by  recognition 
that  little  is  known  about  the  physics  of  evaporation  in  fabrics  beyond 
tne  guess  that  the  rate  of  heat  supply  may  equal  the  rate  of  latent-heat 
expenditure.  Fabrics  come  in  a  great  variety  of  very  different  structures 
and  as  a  first  step,  a  structure  characteristic  of  "typical"  porous  media 
is  here  envisaged  in  which  the  solid  matrix  is  threaded  by  an  irregular 
network  of  interconnecting,  small  capillaries  along  each  of  which  the 
capillary  bore  varies  greatly  over  relatively  small  distances.  The 
immediate  challenge  is  then  to  isolate  some  of  the  many  interacting, 
physical  processes  in  a  single  capillary  in  order  to  distinguish  those 
which  really  govern  evaporation  there;  macroscopic  descriptions  must 
needs  reflect  the  insights  thereby  gained. 

A  key  restriction  that  helps  in  dividing  the  difficulties  is  to 
focus  attention  on  sheet-like  media  which  are  Lhin  in  one  oirection,  like 
fabrics,  because  the  escape  of  the  vapor  is  then  relatively  unobstructed 
and  consideration  of  the  processes  in  the  vapor  can  be  postponed  (to 
Section  VIII).  At  first  sight,  evaporation  might  be  expected  to  be 
controlled  by  the  manner  of  heat  supply,  but  for  capillaries  of  realistically 
small  size,  most  forms  cf  heat  supply  have  similar  effects  because  heat 
transfer  across  the  capillary  wai..  is  then  always  important.  Since  the 
physical  signposts  diverge,  unless  one  be  quite  specific,  attention  is 
restricted  to  liquids  similar  to  water,  to  pressures  and  temperatures  typical 

-h  -7 

of  the  outdoors,  and  to  throat  diameters  of  about  10  to  10  cm.  A  final 
dividing  step  is  to  start  with  an  unrealistic  configuration  of  geometrical 
and  thermal  symmetry  in  which  only  a  single  meniscus  needs  to  be  considered 
(Section  III,  IV). 

Analysis  of  the  simple  thermal  balances  for  that  case  shows  that 
significant  temperature  gradients  can  occur  only  very  close  to  menisci 
(Section  IV),  and  a  rough  estimate  of  evaporation  emerges  (Section  IV). 
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It  shows  the  symmetrical  case  to  be  normally  unstable  (Section  V). 

Evaporation  is,  in  fact,  found  to  proceed  in  statically  grossly  unstable 
configurations  under  a  dynamic  balance  depending  drastically  on  viscous 
shear.  The  "Haines  Jumps"  in  the  foreground  of  earlier  accounts  [ 1  -  3 j 
can  occur  only  in  much  larger  passages  than  would  appear  realistic  for 
soils,  oil  recovery  or  fabrics.  Instead,  the  normal  state  in  evaporation 
is  one  of  slow  liquid  motion  leaving  the  small  menisci  almost  stationary, 
while  most  of  the  mass-evaporation  occurs  at  the  .large  ones  (Section  VI). 

These  results  furnish  a  basis  (Section  VII)  for  statistical  estimates  of 
macro-  oic  evaporation  rates,  provided  enough  is  known  about  the 
statisti.  - 1  distribution  of  capillary  throat  sizes.  Such  knowledge 
appears  to  be  an  absolute  prerequisite  for  any  useful  treatment  of  fluid 
motion  in  porous  media  because  viscous  shear  depends  30  violently  on 
throat  size.  As  a  result,  if  two  media  have  the  same  chemistry  and 
identical  shape  for  their  respective  void  passages,  but  differ  in  mere 
geometrical  scale,  then  microscopic  dynamic  balances  C3n  be  quite  different. 
This  discourages  constitutive  theories  of  porous  media,  of  which  invariance 
to  geometric  scale  is  a  basic  premise. 

There  are  other  caveats,  for  instance,  if  the  vapor-air  mixture 
must  pass  through  small  throats,  the  significant  gasdynamicsl  processes 
must  be  anticipated  to  change  the  evaporation  rate  drastically.  On  the 
other  hand,  there  are  also  many  bits  of  luck,  which  make  a  realistic 
fluid  mechancis  of  porous  media  more  accessible.  In  particular,  the 
extreme  magnitudes  of  relevant  combinations  of  physical  parameters  will 
ccme  to  explain,  by  and  by,  why  errors  by  only  a  factor  2,  or  so,  will 
be  treated  so  cavalierly. 

II.  STATIC  EQUILIBRIUM.  When  the  escape  of  the  air-vapor  mixture 

is  unobstructed,  the  pressure  throughout  that  gas  is  effectively  the 

ambient  pressure,  P  .  Admittedly,  since  surface  tension  promotes 
d 

evaporation,  theimal  equilibrium  between  liquid  and  vapour  requires  a 
gas  pressure  at  their  interface  which  depends  on  the  meniscus  curvature. 
Helmholtz* analysis  [4],  however,  shews  this  to  be  a  threshnold  effect,  and 
for  realistic  capillary  bores,  the  thermal  conditions  here  envisaged 
are  well  below  the  threshhold  (5].  With  the  apparent  contact  angle  B 
measured  as  in  Figure  1,  the  pressure  on  the  liquid  side  of  a  meniscus 
is  therefore 

P.  =  P  -  2o/a,  (1) 

x-  a 

where  a  denotes  the  local  capillary  ~odius  and  o  sec  £(  the  surface  tension. 

It  will  be  assumed  that  0  <  B  <  «/2,  for  otherwise,  surface  tension  would 
have  kept  the  liquid  out  of  the  porous  matrix.  Body  forces  will  be  neglected. 
Since  it  is  prohibitive  to  take  account  of  all  the  shapes  of  void-passage 
cross-sections  liable  to  occur  in  s  porous  matrix,  a  will  be  defined  by 

2 

ita  =  area  of  cross-section;  the  error  in  (1)  is  then  one  of  those  treated 
cavalierly. 

By  (1),  a  connected  liquid  column  is  in  static  equilibrium  if,  and  only 
if,  a  takes  the  same  value  at  all  the  menisci  bounding  the  column.  The 
equilibrium  is  stable  if,  and  only  if,  a  does  not  decrease  with  distance 
along  the  capillary  measured  toward  the  gas  side  at  any  of  those  menisci 
(Fig  2). 
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Those  are  merely  local  conditions,  however.  All  kinds  of  void- 
passage  shapes  can  be  expected  in  a  porous  matrix.  Figure  3  indicates 
one  with  a  liquid  column  in  a  position  of  stable  equilibrium.  Suppose, 
however,  that  this  liquid  volume  be  reduced  by,  say,  30?o,  e.g.,  by 
evaporation.  The  remaining  liquid  column  could  not  then  find  a  stable 
equilibrium  position  anywhere  in  the  passage  segment  shown  :  it  must 
move  to  find  a  new,  stable  position  elsewhere.  The  need  for  such 
“Haines  jumps"  [1]  in  the  liquid  configurations  has  dominated  the 
literature  on  the  physics  of  fluids  in  porous  media  [1  -  3]  :  on 
kinematic  considerations,  they  would  be  expected  to  be  sudden  and 
frequent  so  that  fluid  motion  could  appear  as  a  continuous  process  only 
on  a  long-term  average  [3].  Analysis  of  the  dynamics  (Section  VI),  however, 
will  show  that  such  notions  can  be  relevant  only  to  unrealistically  large 
passages. 

III.  THERMAL  BALANCE.  Prolonged  evaporation  must  depend  on  an  external 
heat  supply,  and  it  might  be  anticipated  that,  not  only  the  rate^  but  also  the 
manner,  of  that  supply  has  a  major  influence  on  the  evaporation  process. 
However,  since  the  ratio  of  capillary  volume  to  surface  area  is  proportional 
to  the  capillary  bore,  heat  transfer  across  the  capillary  wall  is  always 
important  in  small  capillaries.  That  transfer  will  be  found  in  section  IV 
to  cause  an  adjustment  of  temperatures  in  the  fluid  and  solid  that  reduces 
the  influence  of  the  manner  of  heat  supply.  To  fix  the  ideas,  it  will  be 
envisaged  that  a  reservoir  supplies  it  to  the  solid  matrix,  in  the  first 
place,  so  as  to  heat  it  gradually,  from  the  original,  uniform  temperature 
Tg  of  the  whole  medium,  which  may  be  considered  to  be  known,  to  a  level 

T^  at  which  it  is  then  maintained.  Specifically,  TQ  may  be  the  outdoor 

temperature,  and  T^  may  be  near  body-temperature,  perhaps  T^  »  T^  +  50°f. 

The  properties  of  the  solid  arc  outside  the  scope  of  this  study  and  will 
be  assumed  uniform.  This  does  not  imply  a  temperature  field  constant  in 
time  and  uniform  in  space,  because  evaporation  makes  the  menisci  act  like 
moving  heat  sinks  of  changing  strength,  but  the  space-average  T  (  of  the 

capillary  wall  temperature  will  change  only  slowly  with  time.  The  stronger, 

local  variation  in  capillary  wall  temperature  can  be  accounted  for 

approximately  by  rough  adjustments  [5]  and  meanwhile,  the  considerations  can 

be  simplified  by  ignoring  the  difference  between  T  and  the  local  temperature 

w 

of  the  capillary  wall.  That  will  lead  to  estimates  of  evaporation  at 

menisci  in  terms  of  1  ,  whence  estimates  of  the  time  development  of  T  by 

w  w 

more  global  considerations  will  follow. 

Since  little  cun  be  known  about  the  shapes  of  realistic  void-passage 
cross-sections,  the  description  of  evaporation  will  be  simplified  greatly 
by  representing  quantities  in  the  fluid  by  their  averages  over  a  capillary 
cross-secticn  and  ignoring  the  errors  resulting  from  use  of  somewhat 
different  averages  in  different  contexts.  The  thermal  description  is  also 
greatly  simplified,  if  no  liquid  motion  couples  the  processes  at  different 
menisci.  I  fiat  is  possible  in  the  idealized  case  of  a  capillary  of  radius 
a(x)  even  in  distance  x  along  the  capillary,  if  the  temperature  field  is 
similarly  even  and  liquid  fills  a  capillary  segment  between  menisci  at 
x  =  t  Z,  with  a'(f.)  >  0  for  static  stability  (Fig  2a).  Such  symmetry  can 


persist,  et  least  in  principle,  and  will  be  aas>jmed  in  this  Section  and 
the  next  one. 

The  local  thermal  balance  per  unit  length  is  then 


2  9  2  3Ta 

"a  SC£9F  =  i?(“^)+2wh(TW-T£)» 

where  the  lefthand  side  represents  the  local  rate  of  increase  of  liquid 
heat  content;  p  and  c  denote  density  and  heat  capacity,  respectively, 
and  the  suffix  £  distinguishes  liquid  properties.  The  first  term  on  the 
righthand  side  represents  the  contribution  from  heat  conduction  in  the 
liquid,  and  the  last  term,  that  from  heat  transfer  across  the  capillary 
wall  on  the  most  rudimentary,  conventional  model  of  a  heat  transfer  rate 
per  unit  wall  area  and  unit  temperature  difference  represented  by  a 
constant  transfer  coefficient  h.  With  the  insignificant  further 
approximation  of  neglect  of  variations  in  X,  p^  and  c^.  the  balance 

becomes 


9_ 

9x 


<»2£> 


+ 


2h 

P£c£a 


(2) 


where  k  denotes  the  usual  heat  diffusivity,  x  =  X/CpgC^).  The  liquid 

therefore  experiences  a  typical  heat  conduction  process  with  variable 
effective  diffusivity,  on  account  of  the  capillary  shape,  and  with  heat 
transfer,  but  without  convection,  on  account  of  the  symmetry. 

A  somewhat  different  balance  arises  at  a  meniscus.  Since  the  liquid 
and  gas  are  there  envisaged  in  dew-point  equilibrium  to  begin  with,  and 
since  the  gas  pressure  remains  at  the  ambient  level  pg  until  Section  VIII, 

any  heat  reaching  the  meniscus  will  result  meanwhile  in  evaporation,  but 
not  [2]  in  a  change  of  the  local  temperature  TM  from  its  original  level 

Tg.  Conduction  through  the  liquid  column  in  x  <  1  contributes  heat  to 

the  meniscus  at  the  rate 

•>>*»  • 

if  the  meniscus  is  at  x  -  £(t)  and  a  denotes  the  capillary  radius  a(£) 

Irt 

there.  Conduction  through  the  gas  in  x  >  £  does  not  contribute  comparably 
because  its  heat  conductivity  is  omaller.  If  the  meniscus  is  markedly 
curved,  heat  reaches  it  also  by  direct  transfer  across  a  short  segment 
of  the  capillary  wall  and  radial  heat  conduction.  The  wall  area  from  the 
meniscus  contact  line  to  the  position  of  its  apex  (Fig  i)  is  approximately 

2 

2i»a  a,  with  0  <  a  =  secB  -  tan3  <  T.  The  rate  of  heat  transfer  across 
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this  area 

2na2  a.hM(T  -  IM)  , 
m  2  M  w  M  ’ 

where  t>M  denotes  a  value  of  the  transfer  coefficient  h  adjusted  [5]  to 

compensate  for  the  error  made  by  confusing  the  local  wall  temperature 
at  the  meniscus  with  its  level  further  away. 

Let  S  denote  a  short  capillary  segment  of  fixed  length  which  is 
stationary  in  a  frame  moving  with  the  local,  liquid  velocity  and  which 
contains  the  meniscus  at  present.  The  gas  pressure  and  temperature  are 
constant  in  S,  and  if  0£/Dt  denotes  the  velocity  of  the  meniscus  in  that 
frame,  the  mass-rate  of  evaporation  in  S  is 

m  =  -ira^2  p?  D2/Dt  .  (3) 

The  mass  of  gas  in  S  does  not  increase  at  a  significant  rate  because  tne 

density  ratio  p  /'p.  is  about  10“^,  in  the  circumstances  envisaged,  so  that 
g  x, 

vapour  leaves  S  at  the  same  mass-rate,  and  the  net  rate  of  mass  loss  in 
S  is  also  m,  since  no  liquid  enters  S.  By  the  First  Law,  the  net  rate  of 
liquid  enthalpy  loss  in  S  equals  the  rate  of  vapor  enthalpy  loss  from  it 
less  the  rate  of  heat  addition  by  transfer,  and  conduction  into  S, 

na  2[?.a  hM(T  -  T  .)  -  XDT„/3x]  =  ml,  (4) 

m  Z  n  w  li  Jl 

where  L  is  the  latent  heat  per  unit  mass. 

IV.  THERMAL  LAYER.  The  use  of  these  balances  requires  a  nondimensional 
notation,  and  it  is  njt  obvious  whether  a  single  length  scale  X  can  be 
representative  of  the  temperature  field  throughout  a  liquid  column.  Even 
without  attention  to  cross-sectional  shape,  the  capillary  is  described 
by  the  two  functions  a(x)  and  a'(x)/a(x)  of  normally  quite  different 
magnitued  and  each  of  which  may  vary  by  orders  of  magnitude  along  a 
capillary.  To  avoid  confusion,  let  attention  be  confined  first  to  a 
liquid  column  segment  adjacent  to  the  initial  meniscus  position  x  =  2.(0) 
and  short  enough  to  occupy  only  a  capillary  segment  charrcterired  by  a 
single  triplet  of  scales  of  the  capillary  radius,  G,  of  a(x)/a'(x)  ar.d  X, 

of  the  unknown  temperature  variation.  But,  there  is  another  thermal  length  scale, 

A  =  (iAa0/h)^  , 

2  2 

of  decisive  significance  because  A  /X  represents  the  ratio  of  the  thermal 
diffusion  scale  to  the  transfer  scale.  How  is  X  related  to  the  other 
three  length  scales? 

Tne  present  model  can  give  no  information  on  how  A  compares  with  a^ 

and  G,  but  a  more  detailed  calculation  [5]  shows  that  A  must  be  anticipated 
to  be  of  the  order  of  the  capillary  radius,  so  that 

A/a^  =  tX/(2agh)]^  s  y^ 

is  a  parameter  of  order  unity;  a  rough  estimate  [3]  is  y^  =*  Accordingly, 

X  =  3g,  unless  this  be  found  to  imply  that  T^( x )  can  vary  only  on  a  longer 


699 


scale,  The  natural  temperature-difference  scale  is  T  -  Tu  •=  A,  and  if 

w  “ 

-  Tt(x,t)  =  T ( x , t )  t>  , 

t  is  measured  in  units  of  a  time  scale  x,  a/a*  in  units  of  G,  and  x, 
a  and  l,  in  units  of  X  =  aQ,  then  the  nondimensional  form  of  (2)  and 

(4)  is 

„  2  , 

_Q_iI  _  lil  a0  II  .  3  T  _  1  .  T  m 

«r  3t  a  IT"  3x  Y^a  ^ 


H  +  _L.-  di  t  .  1(t) 

ax  2  '  &hx  dt  at  "  u;’ 

Yh 

respectively,  because 

rot,  t)  =  i 

in  this  notation  and  because  the  liquid  is  at  rest;  here  =  a^h^/t,  and 
e  =  CgA/L  «  1 

in  the  circumstances  here  envisaged,  eg,  e  =  1/20  for  water  and  A  =  50°F. 

Two  different  time  scales  have  emerged  from  the  balances.  The  shorter, 

aQ  /x,  characterizes  transients  arising  from  imbalance  of  heat  transfer  and 

conduction  which  might  be  anticipated,  eg,  in  a  Haines  jump.  It  is  normally 
a  rather  small  fraction  of  a  second,  eg,  if  h  =  10”^cm2/sec  and  a^  =  1D_zcm, 

2  -1 

then  3g  /h  =  10  sec.  The  motion  of  the  meniscus,  on  the  other  hand,  is  on 
the  longer  time  scale 
aQ2/(xe)  . 

The  first  question  must  be  whether  a  relatively  stable  evaporation  process 
is  possible,  and  to  ex-  ~ine  this,  the  time  scale  x  must  be  identified  with 

2 

Qg  /(ne).  if  also  a^/G  «  1,  as  one-  would  usually  expect,  then  the  lefthand 
side  of  (5)  becomes  unimportant  by  comparison  to  the  righthand  terms,  in 
which  a-1  ®  1,  to  the  same  approximation.  The  balance?  then  imply 


T  =  exp 


X-il(t) 


£(t)  =  1(0)  -  t 


[exetp4  when  the  capillary  segment  under  scrutiny  contains  x  =  0,  in  which 


■nfimnw  jw* 


n  -t  j.  '  ‘jaKaKwzroPf^R*  *  *>*  - 


case 


2  -*(t)/Yh 


cosh(x/Yh>]. 


This  solution  describes  a  very  short  meniscus  layer,  of  thickness  equal  to 
the  thermal  length  scale  A  =  Y^a^,  in  which  virtually  the  whole  process  of 

heat  transfer  to,  and  heat  conduction  in,  the  liquid  takes  place. 

This  conclusion  destroys  the  analysis  sketched  so  far  because  one  of 
its  main  premises  —  that  thermal  balances  can  be  formulated  in  terms  of 
cross-sectional  averages  —  cannot  apply  to  precisely  the  short  capillary 
segment  containing  the  curved  meniscus  and  all  significant  temperature 
gradients!  Any  tenable  analysis  of  the  temperature  field  must  account 
for  the  geometry  cf  the  meniscus,  but  that  depends  mainly  on  matters 
accessible  only  to  vague  speculation,  at  Dest,  namely  the  shape  of  the 
capillary  cross-section  and  the  apparent  contact  angle. 

If  a  tenable  analysis  could  be  performed,  on  the  other  hand,  it 
would  necessarily  lead  to  the  same  dimensional  groups  and  would  therefore 
also  predict  a  dimensional  meniscus  velocity  of  the  form 

Y  —  .  Y^ 

ao  V? 

and  a  dimensional  evaporation  time  of  the  form 


Y~^0ag/(K£) 

for  a  capillary  segment  of  length  aQ  What  has  no  rational  support  is  the 
2 

value  (Yh+  aj)/Yh  of  the  nondimensional  coefficient  y  predicted  by  (6). 

Thought  about  extreme  cases  indicates  however,  that  the  correct  value  of 
Y  cannot  plausibly  be  far  from  order  unity  and  indeed,  that  y  =  3,  A  or  5 
cannot  usually  be  very  wrong  To  fix  the  ideas,  therefore,  the  value  y  -  ^ 
will  be  adopted  speculatively  for  illustration.  For  water  (with 


m  *  0.0014  cm  /sec)  and  e 
give  roughly  the  meniscuo 
following  table. 

=  1/20,  various  capillary 
velocites  and  evaporation 

radii  and  lengths  then 
times  listed  in  the 

TABLE  J 

a^Ccm) 

10"2 

io-’ 

io“4 

AcH/aQ 

3 

es 

i 

o 

T— 

X 

3  x  10-1 

3 

cm/sec 

*0 

10'1 

CM 

1 

o 

1 

o 

-2  -3 

10  L  10 

cm 

Vo/(aeK) 

3 

1  1 

3  30 

3  x  10~3  3  x  10“4 

sec 

In  sum,  the  analysis  has  been  wrong  in  everything  but  its  result. 

Most  of  all,  what  has  been  proven,  if  only  by  contradiction,  is  that  the 
temperature  variation  associated  directly  with  evaporation  from  a 
meniscus  must  be  quite  local.  It  follows  that  the  main  results  are  also 
independent  of  some  other  premise^  There  can  be  no  significant, direct 
thermal  interaction  between  different  menisci  even  if  the  liquid  moves 

2 

on  the  time  t,cale  a^  /(kg);  the  meniscus  velocity  should  then  be 

interpreted  as  that  of  the  meniscus  relative  to  the  adjacent  liquid  and 

an  must  represen*-  the  scale  of  the  capillary  radius  a  at  the  instanta- 
u  m 

neous  meniscus  position.  The  manner  of  heat  supply,  moreover,  can  have 
less  direct  influence  on  the  local  process  than  might  have  been  thought 
at  first  :  apart  from  the  local  dip  in  solid  temperature  at  the  meniscus 
(accounted  for  by  a  correct  value  of  y)  the  solid  temperature  background 
of  a  capillary  must  reflect  the  macroscopic  scale  of  the  solid  matrix 
as  a  whole  and  must  therefore  appear  effectively  uniform  on  the  length 
scale  a^  of  the  local  evaporation  process. 

In  the  first  place,  (6)  applies  only  to  a  capillary  segment  in  which 
the  radius  differs  by  less  than  an  order  of  magnitude  from  that  at  the 
initial  meniscus  position.  Once  evaporation  has  cleared  that  segment, 
however,  an  analogous  calculation  with  a  different  scale  applies  to  the 

nexL  segment.  Since  narrow  segments  are  seen  to  clear  in  a  much  snorter 

time  than  wide  ones,  it  does  not  appear  worth  entering  here  upon  the 

refinement  of  replacing  an  from  the  start  by  the  capillary  radius  a  (t) 

u  m 

at  the  meniscus. 

V.  INSTABILITY.  Before  evaporation,  all  menisci  bounding  a 
connected  liquid  column  must  be  of  the  same  size  (Section  II),  but  in  a 
realistic  porous  medium  a'(x)  cannot  also  be  expected  to  have  the  same 
value  at  different  such  menisci.  The  meniscus  velocity  (Section  IV)  ther. 
takes  the  same  value  at  all  the  initial  menisci  positions,  but  if  they 
all  started  to  move  with  it,  static  equilibrium  would  be  lost  promptly. 
Surface  tension  acts  towards  restoring  it,  but  the  differences  in  meniscus 
velocity  relative  to  the  liquid  act  in  the  opposite  sense.  Stability  of 
evaporation  therefore  poses  a  question  different  from  that  of  static 
stability  (Section  II). 

To  examine  it,  consider  a  liquid  column  bounded  by  two  menisci  at 
which  the  gas  pressure,  p  ,  and  temperature,  T  =  T  ,  remain  the  same, 
but  at  which  the  capillar^  radii  differ.  To  illuminate  the  distinction 
from  static  stability,  suppose  a'(x)  is  monotone  over  the  whole  liquid' 
filled  capillary  segment,  which  contains  a  throat  (Fig  4).  Denote  the 
meniscus  positions  by  x  =  4+  and  x  =  4  <  4+  and  the  capillary  radii 

there,  by  a(4+)  =  a+  and  a(4_)  =  a_  <  a+,  respectively;  ie,  the  smaller 

meniscus  is  at  the  lefthand  end  of  the  liquid  column.  By  (1),  surface 
tension  generates  a  pressure  difference 


8  —  3 

p(£.  )  -  pU  )  =  2a  — - : 


driving  the  liquid  towards  the  left. 

If  a'(x)  is  not  too  large,  the  viscous  shear  generated  by  the 
ensuing  liquid  motion  sets  up  a  pressure  gradient  related  approximately 
by  Poiseuille's  formula 

g  =  - a4 

8u  dx 

to  the  mass-flow  rate  Q  (counted  towards  the  right),  which  is  independent 
of  x,  by  mass  conservation.  Since  liquid  density  variation  is 
insignificant. 


P«U  -  pU_)  r  -M  J*+  [a(x)3-4  dx, 


and  since  the  main  contribution  to  this  integral  arises  from  the  throat 
region,  it  promotes  clarity  to  write  the  integral  as  fc^/a^  in  terms  of 

the  throat  radius  a^.  and  a  "throat  length"  The  mass-flow  rate 

generated  by  surf ice  tension  is  then 


Q  =  - 


tpct  ft_  f+J  a 
AU  a+a_ 


and  the  corresponding  cross-sectional  averages  of  liquid  velocity  are 

2  2 
Q/(npa+  )  at  the  right  meniscus  and  Q/(~pa  ),  at  the  left  one. 

Evaporation,  on  the  other  hand,  retracts  the  menisci  into  the  liquid 
with  velocities  yxc/a^  and  yne/a  ,  respectively  (Section  IV).  The  center 

of  the  liquid  column  therefore  shifts  at  the  rate 

2  2  2  2 

u  *♦  *  *-  m  a.  - a.  r  r  at  i  a.  * a.  i 
dt  2  =  2  v>-  l T "  U.a_J  J  ’ 

where 

Ce  =  exp/Coa^) 

is  a  capillary  number  based  on  evaporation  velocity  and  is  normally  very 
small,  if  the  apparent  contact  angle  is  not  close  to  90°  (fig  1);  for 

water,  eg,  Aa^t'e  ~  10"  cm.  The  factor  of  Ce  in  the  last  bracket, 

however,  tends  to  be  evan  smaller,  as  long  as  a  and  a  remain  large 


compared  to  the  throat  radius  a^..  During  such  a  phase  of  evaporation, 

the  liquid  column  therefore  shifts  towards  the  right,  ie,  in  the 
direction  opposite  to  that  suggested  by  surface  tension  alone.  Any 
statically  stable  liquid  configuration  with  am  >>  a^  is  therefore 

unstable  under  evaporation. 

If  the  menisci  were  found  close  to  the  throat,  on  the  other  hand, 
in  the  last  stage  of  evaporation,  then  the  second  term  in  the  last 
bracket  would  be  large,  the  liquid  column  would  move  leftward,  and 
the  smaller  meniscus  would  move  away  from  the  throat. 

In  sum,  most  of  the  evaporation  must  be  anticipated  to  occur  in 
liquid  configurations  that  are  not  static  equilibria,  and  there  might 
be  preferred  positions  for  the  smallest  menisci. 

VI.  DYNAMIC  BALANCE,  for  a  realistic  impression  of  evaporation  in 
porous  media,  one  must  therefore  consider  liquid  configurations  far 
from  static  equilibrium,  for  instance,  such  as  indicated  in  Figure  5, 
which  envisages  a  situation  that  might  be  seen  in  a  snapshot  of  a 
"Haines  Jump"  after  evaporation  has  made  one  meniscus  clear  a  throat. 

The  disparity  of  the  meniscus  sizes  then  generates  a  marked  pressure 
difference  driving  the  liquid  towards  the  smaller  meniscus,  and  an 
unsteady  liquid  motion  must  be  anticipated.  There  are  two  very  small 
time  scales,  namelv  the  liquid  column  length  divided  by  the  sound  speed 

2 

in  it  and  the  time  scale  afc  /v  of  viscous  diffusion  of  shear  from  the 

capillary  wall  in  the  throat  region  (Fig  5),  which  is  about  ICf^sec  in 

water,  if  a,  ~  TO'^cm.  The  evaporation  time  scales  (Section  IV)  are 
much  longer^  and  the  full  viscous  shear  must  therefore  be  expected  to 
nave  been  established,  particularly  in  the  threat  region.  The  drastic 
degree  to  which  thJs  viscous  shear  in  small  capillary  throats  will  be 
seen  presently  to  control  evaporation  illustrates  the  reason  for  the 
prominence  of  Darcy's  law  in  porous  fluid  mechanics. 

The  pressure  imbalance  drives  a  mass-flow  rate  Q(t)  <  0,  since  it 
is  directed  towards  the  smaller  meniscus  (Fig  5).  At  the  same  time, 
the  menisci  retract  into  the  liquid  with  their  respective  evaporation 
velocities  and  "yHe/a  (Section  IV).  It  will  promote  clarity, 

nr 

and  help  to  distinguish  the  more  generic  case  from  that  discussed  in 
the  preceding  Section,  to  exploit  the  disparity  in  meniscus  sizes 
(Fig  5)  to  the  degree  of  neglecting  a  against  a+.  The  larger  meniscus 

2 

is  then  considered  to  move  just  with  the  velocity  dx+/dt  =  Q/(npa+  ), 
but  the  smaller,  to  move  with  the  velocity 


dx  /dt  =  yxe/a  +  Q/(Ttpa 


relative  to  a  fixed  frame.  The  pressure  difference  is  now  approximated 
as  p  -  p  =  2o/a  and  the  total  shear  stress  is  -8p  QH. /( npa. ^ ) ,  in 


term--  of  the  "throat  length"  of  Section  V.  In  small  capillaries, 

the  pressure  drop  and  viscous  shear  cannot  come  into  significant 
imbalance,  and  therefore, 

2o/a_  =  -8y  Qi^/Ufa^). 

This  is  the  microscopic  version  of  Darcy's  law  for  liquid  motion  driven 
through  a  small  capillary  by  surface  tension  osecB  at  menisci  of 
disparate  size.  Since  it  relates  Q  to  a_,  (7)  may  be  written 

2 

dx_  p  0  /•  a 

3-  dt  [  1  "  4YeHy&t  [  a_ 

and  since  da/dx  <  0  at  the  position  x  =  x  (t)  of  the  smaller  meniscus 

(Fig  5),  this  shows  the  approximate  dynamics  to  be  represented  by  an 
equation  of  the  structure 

C1  Tit  (a-2)  =  ^  =  -2yea'(x_)  >  0,  (8) 

a 

2 

and  with  increasing  time,  a  must  approach 
c2  =  oat4/(4Yexy£t)  . 

This  does  not  represent  a  strict  equilibrium  because  T^,  and 

therefore  also  e,  may  change  slowly  with  time,  and  in  any  caje,  the 
liquid  keeps  moving,  but  only  a  minor  drift  of  the  smaller  meniscus 
results  tnerefrom.  In  terms  of  a  hybrid  capillary  number 

Ct  =  £Ky/(oat) 

based  partly  on  evaporation  and  partly,  on  throat  radius,  the  smaller 
meniscus  remains  close  to  the  position  where 

a  /at  ~  (4YCtXt/at)~^  .  (9) 

The  capillary  radius  at  the  smaller  meniscus  is  thus  seen  to  depend 

2 

most  of  all  on  the  throat  radius,  indeed,  to  be  proportional  to  a^  ,  on 

account  of  the  dominance  of  viscous  shear  in  small  capillary  throats. 

For  a  rough  impression,  for  water,  e  =  1/20,  a^  =  10~^cm  and  sec  8  =2, 

the  hybrid  capillary  number  is  Ct  ~  1C”^j  a  value  15  of  4y  is  unlikely 
to  be  wrong  by  a  large  factor,  and  if  2^  -  20a^,  then  the  last  formula 


predicts  a  /a^  ~  20.  Figure  5,  accordingly,  gives  a  reasonable  impression 

of  a  typical  configuration.  Table  I  gives  an  impression  of  the  liquid 

-3  -2 

velocity  s  for  a^.  =  10  cm,  eg,  it  is  only  about  10  cm/sec  at  the 
smaller  meniscus,  and  the  velocity  dx+/dt  of  the  larger  meniscus  relative 
to  a  fixed  frame  is  even  less. 

Most  of  the  mass-evaporation,  on  the  other  hand,  cccurs  at  the 
larger  meniscus  (Fig  5)  because  its  area  is  larger;  by  (3),  it  is 


m  =  Trypnea  =  irya  X( T  -  Tu)/L  .  (10) 

For  water  and  a^  =  1mm,  eg,  it  would  be  about  10~^gr/sec.  In  turn,  an 

impression  of  the  dependence  of  the  wall-temperature  level  T^  upon  the 

external  heat  supply  begins  to  emerge,  because  the  rate  Qh  of  that 

supply  per  meniscus  is  mL.  In  a  liquid  column  bounded  by  just  two 
menisci  of  disparate  size,  the  smaller  expends  relatively  little  of 
this,  so  that  (10)  shows  the  external  heat  supply  to  such  a  column 
to  depend  only  on  Tw  and  cn  the  capillary  bore  at  the  larger  meniscus. 

The  critical  importance  of  capillary  size  merits  re-emphasis 
here.  The  pressure  difference  due  to  surface  tension  is  proportional 

to  am_1,  and  so  is  the  meniscus  velocity  relative  to  the  liquid,  but 

the  pressure  drop  due  to  viscous  shear  is  proportional  to 

Accordingly,  if  two  porous  samples  be  compared  which  are  identical  in 
regard  to  chemistry  and  to  shape  of  the  void-passages,  but  differ  by  a 
factor  10  in  the  size  of  those  passages,  then  the  dynamic  balances  for 
them  differ  by,  essentially,  a  factor  10^  and  therefore,  the  fluid 
physics  in  the  two  samples  may  be  quite  different.  That  contrasts 
strongly  with  constitutive  theories  of  porous-media  mechanics,  of  which 
invariance  under  mere  change  of  geometric  scale  is  a  main  premise. 

It  appears  doubtful,  therefore,  that  a  substantive  description  of 
fluid  mechanics  in  porous  media  is  obtainable  without  some  insight  into 
the  fluid  dynamics  on  the  microscopic  level,  whence  macroscopic 
behavior  must  spring. 

VII.  MACROSCOPIC  IMPLICATIONS.  For  an  impression  of  global 
evaporation  An  a  porous  medium,  a  thermodynamically  steady  phase  may 
need  to  be  distinguished  from  an  initial,  transient  phase.  The  later 
phase  is  characterized  by  essential  equality,  at  any  time,  of  the 
rates  of  external  heat  supply  and  of  latent-heat  expenditure.  The 
global  mass-rate  of  evaporation  is  then  immediately  known  and  a  lower 
bound  Tg  of  total  evaporation  time  can  be  deduced  as  that  which 

evaporation  of  the  initial  liquid  mass  would  need  under  such  conditions. 
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Whether  such  a  late  phase  occurs  at  all,  or  at  the  other  extreme, 
whether  the  transient  phase  is  of  no  importance,  must  he  judged  from 
comparison  of  Xg  with  the  time  scale  of  the  transient  phase.  The  latter 

may  be  one  of  three  scales,  of  which  the  first,  ,  cnaracterizes  the 

rate  at  which  the  external  reservoir  can  communicate  heat  to  the  porous 
aggregate  and  a  second,  characterizes  the  rate  at  which  heat  can 

be  distributed  through  the  solid  matrix.  Neither  of  these  is  within 
the  scope  of  this  account,  but  the  time  scale  x^  of  liquid  response 

to  the  heat  supplied  to  it  can  be  predicted  on  the  basis  of  the  present 
results,  if  adequate  knowledge  of  the  statistical  distribution  of  void- 
passage  sizes  is  at  hand. 

Indeed,  if  even  a  relatively  small  number  of  large  passages  thread 
the  porous  aggregate,  then  all  the  "action"  will  occur  in  them,  and 
Haines  Jumps  may  be  there  observable,  while  the  rest  of  the  medium  remains 
essentially  inert  until  the  by-passes  have  cleared  to  an  extent  making 
them  effectively  a  part  of  the  outer  boundary  of  the  medium. 

On  the  other  hand,  if  enough  passage  throats  of  sufficiently  small 
size  are  distributed  sufficiently  well  through  the  aggregate,  then  they 
will  anchor  the  liquid  and  permit  only  creeping  motion.  The  time  scale 
T-j  would  then  be  expected  to  be  essentially  that  of  transition  from 

static  stability  to  dynamic  balance,  which  (8)  shows  to  be 

x3  =  c^g/Ue)  =  (ag/e)2/[2Yn|a'(x_)|]. 

Since  it  is  seen  to  depend  most  of  all  on  ag  and  e,  a  useful,  macroscopic 

estimate  of  this  scale  requires  both  a  judicious  choice  of  e  between 
0  and  its  level  in  the  steady  stage,  and  also  a  statistically  valid 

2 

measure  na^  of  the  cross-sectional  areas  of  small  throats,  whence  the 

corresponding  measure  ag  of  capillary  radii  at  the  small  menisci  can 

be  deduced  by  (9).  For  a  very  rough  impression  (which  may  well  be  mis- 

_2 

leading  xn  regard  to  specific  cases),  if  e  =  1/50,  ag  =  10  cm  and 

|a'(x  )|  =  10  were  appropriate,  then  for  water,  x^  ~  4  min. 

If  x3  should  turn  out  to  be  much  longer  than  Xg,  x^  and  X£,  the 

microscopic  dynamics  of  the  main  phase  of  e>  iporation  would  be  that 
described  in  Section  V.  The  stability  analysis  there  given  is  not  linearized, 
and  its  results  therefore  apply  to  the  whole  transition  from  static 
stability  to  dynamic  balance.  Of  course,  once  the  initial  configuration 
of  static  stability  has  been  left  well  behind,  the  much  simpler  approxima¬ 
tion  of  Section  VI  becomes  adequate.  The  translation  here  of  the  micro¬ 
scopic  description  into  a  predictive  algorithm  on  the  macroscopic  scale 


may  be  premature,  however,  because  its  usefulness  is  likely  to  depend 
critically  on  a  more  precise  knowledge  of  the  statistical  distribution 
of  void-passage  sizes  than  appears  available  to-date  for  any  real  sample. 

VIII.  VAPOR  idANSPORT.  For  evaporation  in  dynamic  balance,  the 
time-dependence  of  the  processes  in  the  air-vapor  mixture  may  also  be 
expected  to  amount  to  no  more  than  a  slow  drift  leaving  the  processes 
quasi-steady.  Transfer  from  the  capillary  wall  will  heat  this  gas, 
but  if  the  pressure  differences  in  it  are  insignificant  because  no  small 
throats  obstruct  its  passage,  then  the  attendant  density  change  will 
not  be  worth  accounting  for,  at  the  temperature  levels  here  envisaged. 

2 

Accordingly,  tne  volume  flow  rate  ita  u  of  gas  will  also  be  considered 
constant  along  the  passage. 

The  mass  evaporated  consists  of  vapor,  but  the  gas  flow  moves  the 
air-vapor  mixture  and  must  therefore  be  accompanied  by  diffusion  of 
vapor  and  air  into  each  other.  The  gas  at  the  meniscus  must  be  at  its 
dew  point,  so  that  the  partial  vapor  pressure  there  is  the  saturation 
pressure  at  the  meniscus  temperature  T^.  For  water  vapor,  e..g.,  that 

partial  pressure  is  about  1/AO  (or  3/40)  at  T^  -  293  (or  313)°  K,  and 

the  gas  even  at  the  meniscus  then  consists  almost  entirely  of  air. 

Since  the  partial  vapor  density  is  even  smaller  [4],  the  diffusion  of 
the  vapor  is  adeauately  approximated  by  the  standard,  linear  model  cf 
Fick's  law,  j  =  -  0Vpv,  for  the  vapor  flux.  With  unsteadiness  already 

neglected,  the  same  mass-flow  rate  of  vapor  mus^  cross  every  capillary 

cross-section,  so  that  Tia  upv  -  na  6dpv/dx  is  independent  of  x.  It 

follows  that 

pv  '  pvm  (^e)/q. 

-  =  e 

P  -  p 
ve  vm 

where  subscripts  m  and  e  distinguish  respective  values  at  meniscus  and 
capillary  exit, 

5  "  \  \  [a(s)]"Z  ds, 

J  x 

m 

and  q  =  0/um  is  a  diffusion-length  s  ale  based  on  the  gas  velocity  um 

at  the  meniscus.  Most  of  the  diffusion  therefore  occurs  within  a  £- 
distance  q  of  the  exit,  by  contrast  to  the  heat  transfer,  most  of  which 
occurs  fairly  close  to  the  meniscus. 

The  process  can  be  radically  different,  however,  if  the  gas  must 
pass  through  a  small  throat.  Let  u  denote  again  the  cross-sectional 
average  of  the  gas  velocity  and  let  subscripts  £,  g,  v,  a,-m  and  t  distin¬ 
guish  reference  to  the  liquid,  gas,  vapor,  air,  meniscus  and  throat, 
respectively.  Then  from  the  estimate  of  meniscus  velocity  relative 


to  the  liquid  in  Section  IV,  a  u  =  yep.p./p  ,  approximately,  and  since 

mm  *>  *«  v 

this  is  independent  of  meniscus  size,  so  is  the  Reynolds  number  Rem  = 

2 

=  a  u  /v  of  the  gas-flow  at  the  meniscus.  Since  v  *»  0.15  cm  /sec 
m  m  a  a 

and  Pg/Pv  ~  10”*  for  water,  Table  1  (Section  IV)  indicates  Reffl  =  2  to 

2 

be  a  rather  typical  value.  Tne  mass- flow  rate  m  =  na  p^u  is  independent 

of  x  in  near-steady  evaporation,  and  apart  from  the  influence  of  density 
changes,  the  local  Reynolds  number  Re  =  au/v  of  the  gas-flow  varies 

O 

in  proportion  to  am/a(x).  For  most  plausible  values  of  the  gas- 

flow  therefore  remains  laminar  even  in  a  throat,  and  the  pressure  drop 
can  again  be  estimated  from  Poiseuille's  formula, 

3  2  3 

a  dp/dx  =  -  8u  m/(7ia  p  )  =  -  8u  a  u  /a  , 
a  g  a  m  m 

_7 

where  8uga(num  is  independent  of  meniscus  size  and  typically,  «  5  x  10  gr 

-7  2 

when  u  =  2  x  10“  gr  sec/cm  . 
a 

_2 

If  now  a  =10  cm,  to  fiu  the  ideas,  then  a  /a.  =10  yields  a 
m  m  t 

value  of  5  x  llj  atm  for  |a  dp/dx |  at  the  throat,  and  the  pressure 
drop  is  insignificant.  If  affl/at  =  ICO,  however,  then  tho  estimate  suggests 

a  value  of  5  atm  for  |a  dp/dx J  at  the  throat,  and  not  only  the  estimate, 
but  clearly  also,  most  of  the  premises  and  csBumptions  of  this  Note, 
collapse.  If  the  evaporation  estimates  remained  valid  when  the  gas 
must  pass  through  very  small  throats,  they  would  imply  major  gasdynamical 
effects  in  such  throats,  the  work  expended  on  them  would  play  a  major 
role  in  the  thermodynamic  balances,  and  the  meniscus  temperature 


could  not  be  expected  to  be  close  to  the  initial,  ambient  temperature 
Tg;  the  physics  of  evaporation  would  be  quite  different  from  that  here 


described.  Accordingly,  the  microscopic  physics  of  evaporation  may 
depend  rather  drastically  on  whether  a  porous  medium  is  formed  into 
a  sheet-like  or  ball-like  aggregate... 


The  great  sensitivity  of  quantitative  estimates  to  void-passage 
size  suggests  that  a  profitable  discussion  of  fluid  mechanics  in  porous 
media  may  need  to  relate  to  quite  specific  circumstances.  In  particular, 
if  thought  returned  to  fabrics,  it  appears  unlikely  that  the  same  micro¬ 
scopic  physics  could  describe  evaporation  from  wool,  gore-tex  or  pile. 
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A£.£2EA££*  A  numerical  simulation  is  presented  for 
investigation  of  the  early  phase  of  the  flow  interaction  between 
a  water  jet  and  a  chemical  contaminant  residing  in  cavities  of  a 
wall  and  in  corners  of  two  perpendicular  walls.  Such  a 
interaction  often  occurs  in  surface  decontamination  processes. 
The  flow  model  for  this  analysis  is  a  two-dimensional,  two-fluid 
flow  govorned  by  the  unsteady  Navier-Stokes  equations.  The 
equations  were  solved  via  finite  difference  schemes  using  the 
S0LA-V0F  code.  Computer  plots  of  the  flow  development  are 
presented.  The  results  show  that  an  inclined  jet  is  more 
effective  than  a  normal  Jet  for  decontaminating  these  confined 
geometries.  In  all  flow  cases  studied,  the  impact  pressure  on 
the  impingement  wrll  far  exceeds  the  corresponding  steady-state 
dynamic  pressure  of  the  jet. 

JL  INTRODUCTION.  Utilization  of  liquid  jet  spray  is  one  of 
the  most  practical  and  most  effective  means  for  decontaminating 
Army  vehicles  in  chemical  warefare  or  for  surface  cleaning  in  the 
industry.  The  procedure  is  to  use  the  force  produced  by  the 
turning  of  jet  stream  at  the  impingement  to  displace  the 
contaminant . 

For  a  plane  wall  decontamination  using  a  water  jet  spray, 
Chang  [ll  has  characterized  the  interaction  of  the  jet  with  the 
contaminant.  In  many  areas  there  exist  cavities  under  a  surface 
or  corners  formed  by  two  perpendicular  walls,  as  depicted  in  Fig. 
1.  The  chemical  contaminant  under  consideration  may  reside  in 
these  confined  geometries  in  the  form  of  a  drop  or  a  layer  of 
fluid  covering  the  entire  bottom  surface  of  the  geometries. 

The  flow  interaction  involves  two  fluids,  the  jet  fluid 
(water)  and  the  contaminant,  and  there  is  an  interface  in 
between,  presenting  a  complex  two-fluid  problem.  To  simplify  the 
analysis,  we  treated  the  interaction  as  a  two-dimensional  flow. 
The  flow  field  is  governed  by  the  unsteady  Navier-Stokes 
equations  which  were  solved  numerically  via  finite  difference 
schemes  using  the  S0LA-V0F  computer  code  [2]  . 

The  emphases  of  this  study  are  on  the  early  evolution  of  the 
contaminant  drop  and  the  magnitude  of  the  impact  pressure  on  the 
bottom  surfaces  of  the  confined  geometries.  Computer  plots  of 
the  flow  process  are  presented.  The  effect  of  the  angle  of  jet 
incidence  on  the  flow  is  discussed.  The  results  obtained  provide 
useful  data  for  the  design  of  efficient  jet  sprays  used  for 
chemical  decontamination. 

II*  ElflS  M£££I£  A££  SJQI£MI2!S  £SSAIIJ2M*  Figs.  2  and  3  are 
the  models  of  the  pre-impingement  configurations  corresponding  to 
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the  schematics  presented  in  Fig.  1.  The  shaded  areas  are  the 
regions  initially  covered  by  the  contaminant  and  the  rest  of  the 
space  in  the  geometries  is  filled  with  water.  The  dimensions  of 
the  contaminant  drops  are  3  mm  by  0.6  mm,  representing  the 
average  size  of  the  drops  deployed  on  vehicles.  The  dimensions 
of  the  cavity,  however,  are  representive.  A  single  water  jet 
with  a  steady  and  uniform  velocity  is  directed  to  impinge  on  the 
upper  surface  of  the  water  at  an  angle  of  incidence,  3.  Two 
angles,  0  =  iJ5°  and  0=90°,  have  been  considered.  The  jet  width  D. 
is  1.83  mm  with  which  a  jet  o*ui  perform  decontamination 
effectively  and  efficiently  [1.].  In  Figs.  2  and  3  there  is  a  thin 
water  layer  covering  the  contaminant.  This  layer  may  exist 
practically  and  was  found  .ielpful  in  reducing  the  numerical 
instability  problem  encountered  at  the  water-contaminant 
interface.  Without  the  layer  the  stationary  and  highly  viscous 
contaminant  will  be  in  direct  contact  with  the  high-speed  jet 
fluid  and,  as  a  consequence,  there  is  a  great  shear  stress  at  the 
interface. 

Fig.  4  shows  the  flow  region  and  its  necessary  boundary 
conditions  for  the  flow  analysis  in  a  cavity.  An  outflow 
condition  is  specified  at  the  upper  boundary  so  that  the  fluids 
can  flow  out  the  region  after  the  start  of  the  jet  flow.  The 
setup  for  the  flow  analysis  in  a  corner  is  essentially  the  same 
exceDt  that  an  additional  outflow  condition  is  prescribed  at  the 
left  wall. 

The  governing  equations  of  the  above  flow  models  are  the 
continuity  equation 
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and  the  momentum  equations 
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where  t  is  time,  u  and  v  are  the  x-oomponent  and  the  y-component 
of  the  flow  velocity,  respectively.  The  density  p  ,  the  sound 
speed  c,  and  the  kinematic  viscosity  0  are  assumed  to  be 
constant.  Based  on  the  jet  width  and  the  jet  velocities  used  in 
this  study,  the  Reynolds  numbers  are  between  20  and  2000.  Within 
this  range,  Eqs.  (2)  and  (3)  are  felt  to  be  appropriate  without 
considering  turbulence  effects. 
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For  the  tracking  of  the  water-contaminant  interface,  we 
define  a  function  F,  called  the  fractional  volume  of  fluid 
function,  satisfying  the  relation 
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The  value  of  F  in  a  computational  mesh  cell  is  equal  to  the 
fractional  volume  of  the  cell  occupied  by  the  contaminant.  Then 
the  value  of  F  is  one  in  cells  full  of  the  contaminant  and  zero 
in  cells  containing  only  water.  Cells  with  F  values  between  zero 
and  one  contain  an  interface,  as  illustrated  in  Fig.  5. 

In  order  tc  adapt  the  3CLA-V0F  code  to  the  present  problem 
involving  two  fluid  with  distinct  viscosities,  we  use  the 
following  viscosity  relation 
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where  i)  is  the  average  kinematic  viscosity  of  the  fluid  mixture 
in  a  cell.  and  are  the  kinematic  viscosities  of  the 
contaminant  and  water,  respectively.  Since  the  densities  of  the 
two  fluids  are  different,  the  density  of  the  fluid  mixture  in  a 
cell  is  approximated  as 
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where  /£  and  ft  are  the  densities  of  the  contaminant  and  water, 
respectively. 

III*.  COMPOTATIOMAL  £££JjjJ&.  The  jet  velocities  chosen 
for  computations  are  5  and  10  m/s  (producing  steaay-state 
dynamic  pressures  of  1 2 .<4  kPa  and  *55  kPa,  respectively)  which  are 
practical  for  chemical  decontamination.  The  viscosity  of  the 
contaminant  under  consideration  is  i)e=10  -j)w  and  the  density  is 
fo=1.07/V 

The  computational  results  to  be  presented  are  in  two  parts: 
flow  patterns  and  the  impact  pressure  on  the  bottom  walls  of  the 
confined  geometries. 
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Fig.  6  shows  the  flow  generated  by  a  .  er  Jet  impinging  on 
the  upper  surface  of  a  cavity  filled  with  water  and  with  a 
contaminant  drop  initially  located  at  the  right  corner,  as  seen 
in  Fig.  2a.  It  is  noted  that  the  computer  plots  have  been 
magnified  three  times  in  the  vertical  direction  in  order  to 
provide  a  clear  flow  visualization.  The  plots  in  the  left  and 
the  right  columns  correspond  to  the  $5°~  and  90°-impingement, 
respectively.  The  flow  direction  of  the  main  stream  in  the 
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cavity  is  strongly  influenced  by  the  angle  of  jet  incidence.  In 
the  450-impingement,  the  main  stream  moves  toward  the  left  wall 
and  then  turns  and  exits  the  cavity,  while  in  the  90°-impingement 
the  main  stream  exits  the  cavity  adjacent  to  the  entrance  of  the 
jet  stream.  As  a  result,  the  contaminant  subjected  to  the  45°- 
impingement  has  experienced  a  larger  displacement  along  the 
bottom  wall  than  the  contaminant  subjected  to  the  9  G  °  - 
impingement.  In  addition,  there  is  still  a  small  amount  of 
contaminant  stagnating  at  the  right  corner  at  0.3  ms  in  the  90° 
case.  It  is,  therefore,  obvious  that  a  jet  impinging  at  45°  has 
more  cleaning  power  for  decontaminating  the  cavity.  Fig.  7»  which 
was  obtained  by  using  the  technique  of  embedding  marker  particles 
in  the  region  initially  covered  by  the  contaminant,  shows  another 
view  of  the  evolution  of  the  contaminant  drop. 

In  the  case  that  a  contaminant  drop  is  initially  located  at 
*he  left  corner  of  a  cavity  a3  depicted  in  Fig„  2b,  Figs.  8  and  9 
also  indicate  that  the  jet  impinging  at  43°  is  more  effective  for 
cleaning  the  cavity.  There  is  an  interesting  flew  phenomenon  to 
be  noticed  in  the  45°  case.  The  upward  flew  velocity  near  the 
left  wall  is  greater  than  the  velocity  slightly  far  away  the 
wall.  It  is  attributed  to  the  flow  impingement  on  that  wall. 

Fig.  10  shows  the  results  corresponding  to  Fig.  2c  in  which 
an  impingement  takes  place  in  the  central  part  of  the  cavity.  In 
the  90°-impingement,  the  main  water  stream  induced  by  the  jet 
flow  does  not  reach  the  end  walls.  Therefore,  there  is  little 
movement  in  the  contaminant  along  the  right  end  wall. 

Fig.  11  displays  the  flow  development  corresponding  to  the 
configuration  in  Fig.  2d  in  which  the  cavity  Is  full  of 
contaminant  with  a  thin  water  layer  on  the  top.  At  times  up  to 
0.27  ms,  the  flow  patterns  for  the  two  angles  of  jet  Incidence 
are  similar.  However,  it  the  90°  case  the  movement  of  the 
contaminant  along  the  right  end  wall  slows  down  as  time 
progresses  due  to  a  faster  development  of  viscous  layer  along  the 
wall. 


Next,  we  examine  the  results  for  the  other  kind  of  confined 
geometry  investigated:  the  corners  shown  in  Fig.  3.  Figs.  12  and 
13  present  the  flow  patterns  developed  from  Fig.  3a  and  Fig.  3b, 
respectively.  The  main  stream  is  moving  freely  to  the  left 
because  the  left  end  is  open  to  the  flow.  Though  the  flow 
patterns  for  the  two  angles  of  jet  incidence  are  similar  in 
general,  the  45°  jet  exhibits  slightly  better  performance  since 
it  cleans  up  the  contaminant  in  the  right  corner  faster. 

JEraasura  lisirlfcaiiau  aa  jj&si  Baiisa  Mall 


The  impact  pressure  «n  the  bottom  surfaces  of  the  subject 
confined  geometries  is  another  important  datum  to  be  determined. 
In  some  critial  areas  of  a  vehicle,  such  as  optical  windows,  the 
pressure  applied  without  causing  damage  is  limited  to  a  certain 
level.  Figs.  14  through  17  present  the  result!  •»  ratios  of  the 
instantaneous  impact  pressure  to  the  corresponding  steady-state 


716 


dynamic  pressure  which  is  1/2(  V . 2 )  for  various  flow 
configurations.  Fig.  is  the  result  for  the  configuration  in 
Fig.  2a,  showing  that  the  pressure  ratio  can  reach  13.  In  the 
configuration  in  Fig.  2c,  the  pressure  ratio  shown  in  Fig.  15  is 
even  higher,  approximately  19«  In  the  corner  flows  shown  in 
Figs.  3a  and  3b,  the  corresponding  pressure  ratios  displayed  in 
Figs.  16  and  1 7 »  respectively,  are  relatively  lower.  It  is  a 
result  of  the  change  of  boundary  condition  from  a  noflow  to  an 
outflow  condition.  We  also  notice  that  the  4  5  °- imp ingem  ent 
produces  a  slightly  higher  impact  pressure  in  Figs.  1*4  and  15 
than  the  90°-impingement  does. 

IV.  Computer  plots  have  been  generated  to 

show  the  detailed  flow  development  in  the  early  phase  of  the  jet- 
contaminant  interactions  in  cavities  and  corners.  Based  on  the 
flow  patterns,  a  jet  impinging  at  an  appropriate  inclined  angle, 
say  45°,  is  more  effective  than  a  normal  jet  for  decontaminating 
such  confined  geometries.  The  instantaneous  pressure  rise  on  the 
bottom  surfaces  of  the  confined  geometries  can  far  exceed  the 
corresponding  steady-state  dynamic  pressure  of  the  jet.  In 
general,  the  pressure  rise  is  higher  in  cavities  than  in  corners. 
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Fig.  10  Flow  Development  (corresponding  to  Fig.  2c) 
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ABSTRACT 

The  accurate  firing  of  unguided  projectiles  (bullets  or  rockets)  from  aircraft  leads  to  a  requirement 
for  rapid  computation  of  the  launch  vector  needed  to  assure  the  projectiles  striking  a  given  target.  The 
compulation  of  this  laying  vector  and  fuze  time  is  the  function  of  the  on-board  fire  control  system.  The 
fire  control  system  includes  sensors  which  measure  target  range  and  velocity,  aircraft  attitude,  position 
and  velocity,  and  atmospheric  conditions.  These  measurements  are  fed  to  an  on-board  fire  control 
computer  which  in  real  time,  typically  at  50  Hz,  must  compute  anew  the  laying  vector  appropriate  for  the 
rapidly  varying  variables  which  influence  the  ballistic  trajectory.  Six-degree-of-frer  'om  models,  which  are 
normally  used  in  laboratory  ballistic  modeling  3nd  simulation,  are  computationally  too  slow  and  otherwise 
cumbersome  to  be  implemented  for  real  time  fire  control.  A  methodology  for  developing  an  alternative- 
simplified,  yet  very  accurate,  model  is  described  in  detail. 

1.  INTRODUCTION 

The  fundamental  objective  of  fire  control  ballistics  is  the  provision  of  a  means  for  aiming  and  fuze 
setting  of  a  rocket  or  projectile  so  as  to  best  assure  placement  of  accurate  and  effective  fire  on  a  selected 
target.  This  objective  was  accomplished  historically  by  the  publication,  and  use  in  the  field,  of  the 
classical  "firing  table",  a  book  of  cumbers  tabulated  so  as  to  provide  an  easy  means  for  determining 
azimuth  and  elevation  settings  for  guns  and  rockets.  As  computer  technology  has  evolved,  the  firing  table 
bas  een  relegated  to  manual  backup  and  the  field  computer  computes  gun  and  rocket,  settings  in  real 
time.  Modern  weapon  systems  such  as  computer  controlled  air  defense  guns,  and  helicopters  and  tanks 
which  fire  on  the  move  encounter  a  fire  control  problem  characterized  by  very  rapidly  changing  variables 
which  drive  the  ballistics.  As  a  result,  a  wholly  tew  level  of  difficulty  and  sophistication  is  placed  on  the 
problem  of  lire  control  ballistic  prediction. 

The  fundamental  "generic*  models  used  extensively  for  prediction  of  launch  and  exterior  ballistic 
performance  have  come  to  be  known  as  Six-Dcgree-of-Frrcdom  (SDF)  models.  Such  models  embody  the 
equations  of  motion  for  both  translational  and  rotational  displacement  of  a  projectile  or  rocket.  The 
development,  refinement,  maintenance  and  modifications  for  n»-v  technology,  -of  this  type  model,  has  long 
been  performed  in  Defense  Department  Laboratories,  and  in  particular,  Army  laboratories  such  as  BRL. 
See,  e.g.,  Lieskc  and  McCoy 1  aad  Barnett a.  The  SDF  mode’  is  a  natural  extension  of  the  threc-degree- 
cf-freedom  (TDF)  model,  long  a  mainstay  of  ballistic  fire  control  prediction.  This  latter  model  represents 
only  the  translational  aspect.'  of  projectile  motion  and  is  of  an  order  of  magnitude  less  difficult  in 
computational  labor.  However,  TDF  models  may,  in  some  circumstances,  be  insufficiently  accurate 
because  they  do  not  model  the  interaction  of  translational  motion  with  the  aerodynamic  effects  associated 
with  yaw  and  pitch  along  with  the  yaw  and  pitch  interaction  with  spin. 

^  R  F.  Lieike  »sd  R.  L.  Me  Coy,  *  Eqaitiooi  of  Motios  of  a  Rifid  Projectile,’  BRL  Report  No.  1241,  Much  1061. 

B  Biraeli,  'Trajectoiy  Eqsaiioi*  for  a  Su-Defree-of-Freedom-Missile  Utinj  *  Fixed-Pbae  Coordinate  Sretem,*  Technical  Report 
3381,  Picitiooy  Arsenal,  Dover,  New  Jersey,  Jane  1800. 


Artillery  projectiles,  like  the  classics)  spinning  top  in  mechanics,  have  very  high  frequency  motion 
associated  with  spin,  precession  and  nutation.  A  fundamental  computational  requirement  of  numerical 
integration,  (the  technique  used  for  solving  these  models)  is  that  oscillatory  variables  need  to  be  sampled 
at  several  points  within  each  oscillation  to  maintain  accuracy  and  stability.  The  result  of  this  is  that  small 
steps  in  time  are  required  in  the  forward  marching  process  of  integration.  Accordingly,  complete  SDF 
calculations  for  spinning  artillery  or  automatic  cannon  projectiles  is  time  consuming,  relatively  expensive, 
and  is  done  sparingly  on  laboratory  computers.  An  alternative  to  the  SDF  and  TDF  models  was  developed 
by  Lietke  and  Reiter  *  which  has  been  found  extremely  useful  for  firing  table  computation  and 
implementation  in  some  ground  based  fire  control  computers.  Their  successful  approach  was  to  develop  a 
modified  TDF  model,  MTDF,  which  incorporated  an  explicit  estimate  of  the  "yaw  of  repose"  and  Us 
effects  into  the  translation  equations  without  having  to  integrate  the  high  frequency  motion  of  the  full 
SDF  models.  This  model  has  resulted,  in  greatly  reducing  the  computational  burden  referred  to  above 
while  incurring  only  a  slight  loss  in  accuracy,  it  is  now  a  standard  model  in  the  repertoire  of  ballistic 
mathematical  tools  widely  used  in  the  United  States  and  the  NATO  defense  community. 

Helicopter  fire  control  requires  the  use  of  ballistic  models  as  described  above.  The  SDF  model  is 
needed  for  rockets  and  the  MTDF  (.md  SDF  )  model  are  needed  for  automatic  cannon.  Unfortunately,  for 
reasons  detailed  herein,  these  models,  can  only  serve  an  intermediate  role,  albeit  an  important  one,  in  the 
process  of  developing  an  on  board-real  time  ballistic  prediction  model.  A  fundamental  problem  in  attack 
aircraft  fire  control  is  that  of  maintaining  the  "timeliness”  of  the  ballistic  solution,  in  a  turning,  climbing 
maneuver,  the  aircraft  velocity  components,  the  geometrical  relations  between  target  and  cannon  /rocket 
and  other  variables-  all  change  rapidly.  Note  also  that  these  models  ready  solve  the  "inverse"  of  the  fire 
control  problem.  In  fire  control  one  specifies  terminal  conditions  such  as  target  location  with  respect  to 
the  launch  platform.  The  solution  desired  consists  of  the  departure  attitude  needed  to  strike  the  target 
and  the  time  of  Bight.  Atcordingly,  one  would  need  to  solve  the  problem  iteratively,  i.e.,  guess  at  a  trial 
solution,  and  continually  readjust  the  departure  angles  until  the  desired  terminal  conditions  are  satisfied. 
Meanwhile,  if  the  aircraft  is  in  a  maneuver,  the  problem  has  changed  because  the  variables  driving  the 
ballistics  have  changed  6ince  the  iterations  were  initiated.  The  consequence  of  this  is  that  a  ballistic 
solution  that  is  not  computed  instantafleocly,  (or  nearly  so),  is  old  and  obsolete  before  the  munition  can 
be  fired.  Despite  the  recent  and  continuing  revolution  in  computer  technology,  the  embedded  computers 
in  aircraft  fire  control  systems  are  small  in  memory  capacity,  and  are  not  fast  enough  to  iteratively  solve 
the  models  described  above  at  tue  required  frequency. 

Modern  attack  helicopters  such  as  the  Cobra  and  Apache  are  armed  with  an  automatic  cannon  and 
a  family  of  2.75  inch  or  Hydra  70  rockets.  By  the  Sip  of  a  switch,  the  pilot  can  select  the  munition  be 
wishes  to  fire.  Accordingly,  the  modoi(s)  embedded  in  the  on  board  fire  control  computer  must  be  able  to 
key  on  this  switching  process  and  compute  the  solution  for  the  munition  selected.  In  fact,  it  is  possible, 
and  sometimes  dcsi.sble,  to  be  able  io  fire  both  the  gun  and  rockets  simultaneously.  The  requirement  for 
representing  Ms  or  more  rocket  types,  each  having  differing  weights,  measures,  aerodynamics,  staging  and 
fuzing  characteristics-  all  add  to  the  need  for  developing  a  common  general  model.  Furthermore,  Army 
aircraft  such  as  the  Cobra  and  Apache  have  differing  modes  of  articulating  rocket  pods  It  is  highly 
desirable  that  the  rocket  ballistics  be  developed  independently  of  the  method  of  pod  articulation.  See 
Appendix  F.  The  logical  strategy  evolves  for  developing  a  procedure'  which  makes  the  general  model 
applicable  to  a  specific  munition  by  selective  retrieval  of  a  pool  of  constants  (based  on  switch  position). 
Some  constants  of  the  model,  however,  may  differ  from  one  aircraft  type  io  another.  Sec  Appendix  E.  This 
necessity  for  providing  a  capability  for  many  munitions  unds  to  reduce  the  storage  capacity  available  for 
the  collection  oi  instructions  related  to  the  model  itself.  The  requirement  for  an  accurate  ballistic 
solution,  for  such  a  family  of  munitions,  varying  types  of  aircraft,  air  defense  guns,  tanks,  or  other  moving 
gun  platforms,  that  caa  be  computed  cyclically,  b  real  time,  during  an  engagement,  leads  to  the  need 
for  the  methodology  described  herein. 

^  R.  F.  Litjfct  sad  M  L.  RtHtr,  ’Eqtatiou  of  Mottos  for  a  ModiStd  Pout  Mast,*  Ballistic  Restart  k  Laboratories  Report  No. 
MH,  Mtrck  IMS. 


2.  GENERAL  DESCRIPTION  OF  THE  MATHEMATICAL  METHODOLOGY 


A  perspective  for  defining  the  mathematic*]  problem  can  be  obtained  by  examination  of  Figure  (2- 
!).  Therein  an  attack  helicopter  in  flight  is  depicted  engaging  a  ground  target.  While  the  scenario 
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Figure  (2-1).  Typical  attack  helicopter  scenario  for  rocket  firing. 


depicts  the  use  of  a  I)ydr*-?0  rocket,  the  employment  o?  an  aufcouutie  caancn  would  be  similar.  Tbe 
telescopic  sight  unit,  (TSU),  is  continually  maintaining  the  line  of  sight,  (LOS),  which  can  be  viewed  as  a 
vector  '  jcnecting  a  reference  point  sb  the  airersft  to  a  selected  point  on  tbe  target.  On  board  sensors 
continually  measure  the  vector  velocity  and  attitude  orientation  of  the  aircraft  2long  with  the  LOS  range, 
x,  ,  between  target  and  aircraft.  For  moving  target.,  sensors  coupled  with  mathematical  "filters* 
continually  estimate  the  vector  velocity  of  the  target.  Target  motion,  along  with  other  considerations, 
determine  components  xn  and  x(  which  along  with  s,  define  when  the  projectile  should  be  in  "one  time  of 
flight*.  Environmental  factors  such  as  wind,  air  density  and  temperature  are  also  provided  by  sensors  on 
the  aircraft.  Toe  temperature  of  the  munition  is  also  k^owa,  either  by  use  of  a  magazine  thermometer  or 
by  assuming  tbat  the  munition  has  the  same  temperatuie  as  the  environment.  Required  information  on 
downwash  <!ik*  to  tbe  rotation  of  the  helicopter  blades  is  obtained  by  mews  described  in  Appendix  D. 
Component-  of  the  gravity  vector  are  made  available  by  reference  to  orientation  of  coordinate  axes  that 
ate  aligned  with  the  local  gravity  vector. 

Tbe  above  described  information  is  fed,  at  typically  50  Hs,  to  the  Fire  Control  Computer,  (FCC). 
The  primary  function  of  the  FCC  is  to  compute  the  angular  settings  (the  attitude  of  departure)  and  for 


M>me  unities.*,,  iuzt  time,  which  best  assures  accurate  placement  of  fire  on  the  target.  The  objective  is 
thus  wen  to  be  tht  development  of  »  collection  of  formulae,  to  implement  in  the  FCC,  which 
accomplishes  this  task.  Toward  this  end  it  is  useful  to  review  two  procedures,  used  previously,  that  can 
serve  as  building  blocks  toward  development  of  a  more  generalized  procedure. 

Global  Fitting  Approach 


A  perspective  on  the  global  fitting  approach*  can  be  obtained  from  the  work  of  Chandler,  Baker  and 
Din  jar  at  the  US  Army  Redstone  Arsenal  and  C.  MasaUis  and  H.  Breaux  at  the  Ballistic  Research 
Laboratory.  That  work  is  reviewed  (and  references  listed)  by  Breaux  4.  The  objective  of  that  work  was  to 
find  an  alternative  to  the  SDF  models  for  use  in  fire  control  with  ground  based  missiles  such  13  the 
Redstone,  Jupiter,  Pershiug  and  Lance.  For  these  stationaiy-at-launch,  ground  systems,  certain 
complexities  unique  to  the  moving  platform  are  not  present  and  the  computational  speed  factor  is  not  as 
critical.  A  computational  cycle  time  of  seconds,  or  even  tens  of  seconds,  is  tolerable.  However,  at  the 
longer  ranges  of  these  systems,  other  complexities  enter  such  as  those  associated  with  earth  rotation  and 
curvature.  Nevertheless,  »  basic  procedure  employed  in  that  work  remains  a*  the  cornerstone  of  current 
methodology.  That  concept  is  depicted  in  Figure  (2-2)  below  and  consists  of  approximating  one  mode! 


Figure  (2*2).  Triangle  of  Approximations  for  Development  of  Fire  Control  Ballistics. 

*  Rtfeitsct  to  sspalmiird  work  is  BRL  related  U  stuck  ktlicopttn  i*  <Red  is  tit  Acks«wftd(mtst. 

*  M  Bet  ms,  *Tlt  CotnpiUUos  of  Fitiif  Tsbks  for  Glided  Mbiilet*  BRL  Report  No.  1341,  November  IMS. 


with  another  model.  First  a  "truth*  mode!  must  be  developed-both  as  a  means  for  assessing  actual 
ballistic  performance  and  then  to  make  possible  the  development  and  accuracy  assessment  related  to  the 
Are  control  model.  This  model  must  represent  all  the  munitions  of  interest—  fired  with  launch  conditions 
representing  the  r'rcraft  environment.  Speed  of  computation  is  not  a  significant  factor  here  and  this 
model  is  normally  mplemented  in  a  laboratory  computer. 

The  global  fitting  approach  consists  of  basically  five  phases. 

(A)  Development  of  the  SDF  Model. 

(B)  Specification  of  a  candidate  fire  control  equation  (FCE) 
mode!  to  be  fitted. 

(C)  Development  of  a  data  base  of  trajectory  calculations  which 
span  the  expected  range  of  all  variables  (and  mixtures) 

in  the  planned  deployment  scenarios. 

ID)  Fitting  of  the  model  to  obtain  values  for  all  coefficients. 

(E)  Computer  validation  of  the  model. 

Phase  (A)  generally  consists  of  obtaining  all  weights  and  measures,  physical  characteristics,  and 
aerodynamic  functions  that  are  appropriate  to  the  munitions)  of  interest  and  processing  the  data  irto  a 
form  acceptable  to  the  SDF  program.  Special  features  of  the  munition  ballistics  or  lauuch  dynamics  may 
require  modification  of  the  SDF  program.  This  is  a  very  critical  phase  in  that  everything  done  hereafter 
binges  on  the  adequacy  and  accuracy  of  this  model.  Phase  (B)  is  the  most  difficult  aspect  of  th*  problem 
and  is  the  core  of  the  methodology  to  which  a  large  part  of  this  paper  is  devoted.  Phase  (C)  consists  of 
designing  a  matrix  of  conditions  and  use  of  the  SDF  program  to  compute  the  corresponding  trajectories. 
The  fitting  process.  Phase  (D),  is  itself  a  part  of  the  methodology.  However,  the  procedure  designed  by 
Brteuz  *,  will  be  used  herein  to  form  a  hybridized  methodology  which  will  be  described.  Phase  (E)  is 
designed  to  test  the  execution  of  the  model  in  a  form  similar  to  it’s  field  employment  and  simultaneously 
compare  its  predicted  results  with  the  SDF'  "truth  model”. 

Phase  (E)  should  not  be  confused  with  field  validation.  In  field  validation  the  results  of  the  effort, 
namely  the  ballistic  FCE’s,  are  pr.  jammed  into  the  Fire  Control  Computer  and  actual  live  tests 
conducted.  Poor  performance  in  this  phase  can  frequently  occur  due  to  improper  programming  of  the 
FCE’s,  poor  sensor  performance,  or  a  poorly  designed  or  specified  RDF  model.  If  the  latter  is  found  to  be 
true,  the  SDF  model  must  be  fixed  and  the  process  described  above  must  be  repealed' . 

Closed  Form  Solution  Approach 

A  solution  approach  has  been  employed  by  Norwood  \  for  airborne  gunfire,  that  makes  use  of  the 
"Method  of  Siacci”.  In  this  approach,  the  computational  problem  is  reduced  to  a  collection  of  integrals. 
Norwood's  approach  was  made  more  practical  by  Bcnohreti*  *  who  obtained  closed  form  expressions  for 
the  Siacci  integrals  by  approximating  the  projectile  drag  by  three  connected  fine  segments.  This  approach 
has  the  benefit  of  "permanence"  in  that  the  approximate  solution  is  there  for  all  time-for  all  projectiles 
that  satisfy  the  fiat  fire  assumption  and  which  can  be  adequately  approximated  by  a  TDF  model.  The 
drag  curve  must  also  permit  adequate  approximation  by  the  three  line  segments.  The  disadvantage  is 
that  this  technique  does  not  incorporate  ballistic  effects  that  arise  from  factors  that  only  the  SDF  model 

i~ . — . -.-i— 

Nov  that  tie  methodology  sad  related  software  la*  beea  developed,  reproceuing  for  a  reviaed  or  aew  SDF  data  bate  cat  be  dote 
roitiaely  aid  very  qiickty. 

6  Norwood,  Job*  M ,  *A  Review  et  tbe  Method  of  Siacci  Trajectory  Compotatiea  for  Airborae  Gaatre*  ARL-TM-7I-27,  Applied 
Research  Laboratories,  Tbe  Uahrenhy  of  Texas  at  Aoitia,  Aaitis,  Texaa,  Avgait  1971. 

*  Beeokraitii,  Vitalii*,  ‘A  Cloetd  Form  Siacci  Fvactioa  Method  for  Trajectory  Calcelatioa*,*  Journal  of  Ballistic*,  V.  S,  No.l, 
1832,  pp.  1325-1347. 
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could  predict.  Ballistic  effects  due  to  aerodynamic  "drift”  and  "jump”  and  secondary  drag  effects  related 
to  initial  yaw  and  yaw  rate  must  be  added  to  the  model  by  extraneous  methods  which  require  essentially 
the  same  effort  as  the  global  fitting  approach.  This  method  can  represent  the  free  flight,  phase  of  a  rocket 
but  not  the  boost  phase.  Extension  of  such  a  model  to  include  more  complex  ballistics  generally  leads  to 
the  need  for  some  form  of  fitting  and  in  turn  necessitates  all  the  stages  of  the  global  fitting  approach. 

The  methodology  developed  herein  can  be  viewed  as  a  hybrid  method  which  combines  features  of 
both  the  global  fitting  technique  and  the  closed  form  approximation.  It  can  also  be  viewed  as  a 
generalization  and  extension  of  the  earlier  work  done  by  the  author*.  The  key  aspect  of  the  work 
described  therein  was  the  use  of  step-wise  regression  (least  squares)  to  facilitate  tb  Process  of  model 
building.  This  process  can  be  be  exhibited  by  simple  example.  Assume  that  a  process,  whose  outcome  is 
represented  by  w,  is  dependent  on  the  variables  x,y,».  Assume  that  a  very  accurate  model  exists  that 
permits  the  specification  of  x,y  and  z  and  then  computes  w.  However,  the  computation  is  so  expensive 
and  time  consuming  that  it  can  only  be  done  sparingly  and  at  facilities  remote  from  where  the  need  exists. 
This  suggests  the  need  and  possibility  of  developing  an  approximating  model  which  is  easier  to  evaluate. 
In  attempting  such  an  effort  consider  the  following  two  approaches  to  global  fitting. 

The  Empirical  Approach  to  Global  Fitting 

Here  no  specific  information  on  the  underlying  mathematical  model  is  assumed  known  other  than  the 
dependence  of  w  on  x,  y  and  s.  A  candidate  linear  model  is  specified  by  the  formula 

u;«d(+aiX+S2jr+J!tX+«4Xir+«sxx-t4irz 


+  «I*^+SiXi2+S9^|f  + 


•  •  • 


Numerous  types  of  linear  models  could  be  employed,  however,  a  polynomial  model  is  assumed  for 
illustration.  By  providing  a  data  base  of  w  versus  xj  and  s  for  an  appropriate  mix  and  range  of  the 
variables  and  the  above  model  to  a  stepwise  regression  procedure  as  described  by  Breaux  7’*,  or  Hocking  * 
one  might  arrive  at  a  suitable  model.  The  result  of  cuch  a  regression  procedure  is  the  sequential  listing  of 
submodels,  each  differing  from  the  previous  submodel,  by  one  term  taken  from  the  above  collection  of 
terms.  The  first  submodel  is  the  one  term  alone  which  best  approximates  w.  The  second  model  contains 
two  terms,  including  the  one  term  model  plus  a  second  term  which  when  added  to  the  first  provides  the 
best  approximation.  The  process  continues  in  this  fashion  but  simultaneously  seeks  to  weed  out  terms 
that  are  no  longer  useful  due  to  cross  correlations  between  clusters  of  variables  that  have  been  introduced 
into  the  model.  Variations  in  strategy  and  computational  sequencing  are  discussed  at  length  by  Hocking*. 
By  examining  the  program  output  which  includes  the  progression  of  the  variance  of  residuals  in  w,  the 
correlation  coefficient,  the  ”t"  values  on  the  regression  coefficients,  etc.,  and  perform  ir~  some 
experimentation  on  model  definition,  one  can  generally  build  a  suitable  model  by  this  process. 

The  Physical  Approach  to  Global  Fitting 

In  the  physical  approach  one  first  recognizes  that  the  problem  at  hand  embodies  a  field  of  knowledge  to 
include  a  collection  of  literature.  Such  a  literature  indicate*  that  the  process  leading  to  the  outcome  w 
has  a  "closed  form"  result,  dependent  on  x,y,  and  s,  if  certain  idealizing  assumptions  are  made.  One 
such  outcome  might  be 


'  Brtitx,  H.  J.,  *Om  SUpwi**  Milt:,1*  Lisf  w  .  j*mios ,"  URL  Report  No.  IMS,  Aqui  1*67. 

*  Btcaax,  H.  J..  *A  Modileatioa  of  EL  ■  itoi'i  Tcr.hsiqic  for  Stcpwbe  ftegrtnies  A aatyw  *  Ctaa.  of  tic  ACM,  1WI,  Vol  H, 
No. «. 

9  Hockiaf,  R.  R.,  *Tke  Aaalyiii  tad  Sdectio*  J  "tnabko  is  Liaetr  Rcgraiios,*  Bioactrica,  V.  S3,  Marck  IS76,  pp  1-48 
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u>  *»  A,  *  +  B,  exp[ a  x  y/z) .  (2-2) 

This  might  suggest  that  a  more  intelligently  defined  model  would  proceed  from  Eq.  (2-2)  as  a  base.  For 
this  simple  model  one  might  use  non-linear  least  squares  and  try  to  fit  A„  B,  and  a  as  free  parameter*. 
When  the  model  is  complex,  containing  many  variables  and  terms,  it  is  generally  better  to  linearize  the 
model  and  proceed  as  follows:  In  Eq.  (2-2),  for  example,  expand  non  linear  terms  in  a  Taylor  series 

iii=»a,+  ii|i+ijir/r+(ii(i  yfzf  +  o4  (*  yftf  +  •  •  •  {-’-:*) 

As  before,  this  candidate  model  and  the  data  base  is  processed  by  the  stepwise  regression  process.  The 
resulting  model,  developed  by  the  iatter  procedure,  will,  in  general,  be  superior  to  one  developed  by  the 
empirical  process.  Any  insight,  born  of  experience  and  knowledge  of  the  process,  helps  toward  better 
debiting  the  candidate  model,  used  as  a  point  of  departure,  when  thereafter  employing  a  lilting  process. 

The  fire  control  problem  addressed  herein  is  analogous  to  the  above  simple  example  with  one  very 
important  difference.  The  example  concerned  itself  with  one  process  and  one  approximating  model  was 
needed.  Hence,  either  of  the  two  approaches,  if  found  to  lead  to  an  adequate  model,  could  be  viewed  as 
successful.  The  attack  helicopter  Ore  control  problem,  by  contrast,  can  be  viewed  as  representing  ten  or 
more  related  processes.  If  the  problem  is  addressed  by  the  empirical  approach  described  above,  intuition 
and  experience  indicates  that  the  result  will  be  as  many  separate  models  as  their  are  rockct/munitinn 
lyyts  *. 

Tin*  approach  taken  will  be  one  of  examining  all  components  of  the  launch  and  (light  process,  the 
boost  and  free  flight  coupling,  and  practical  idealizations  which  provide  a  physical  basis  for  simplified 
dosed  form  solutions  leading  to  an  intelligent  definition  of  the  candidate  model.  These  components,  will 
he  pioccssed  by  the  stepwise  regression  procedure  by  use  of  a  data  base  of  calculations  from  an  Sl)F 
program  to  determine  their  adequacy.  The  approach  is  thus  seen  to  be  a  hybridized  one  which  includes 
obtaining  closed  form  approximations  and  then  employs  the  features  of  the  global  fitting  approach  as 
described  above. 


Note:  Due  to  its  length,  only  Sections  1.  and  2.  of  this  pr  er  have  been  included  in  the  Proceedings. 
Information  on  the  availability  of  the  complete  document  c?  btained  by  writing  to: 


Director 

USA  Ballistic  Research  Laboratory 
ATTN:  DRXBR-TSD 
Aberdeen  Proving  Ground,  Md.  21005 


*  Thu  aspect  v»»j  fouod  to  be  critically  important  it  tke  camatly  os-going  Fire  Coatrol  Systems  Integratioa  Proguoi  for  Ot.n 
The  ruisoitude  of  effort,  sad  resulting  time  , chaired,  to  provide  for  as  many  as  aiar  differeal  rocket  types,  forced  as  early  ’freeze’ 
ia  the  baric  strurtsie  of  the  model  long  before  work  was  completed.  The  baric  model  was  tkea  programmed,  validated  for  specific 
rocket  types,  and  thea  coefficieats  for  tke  remaiaiag  members  of  the  family  were  added  as  tkey  were  obtained.  Tkis  permitted 
timely  progress  on  tke  overall  program. 
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SINGULAR  VALUE  DECOMPOSITION  FOR  SOLUTION  OF* 
DIFFERENTIAL-ALGEBRAIC  EQUATIONS  OF  MECHANICAL  SYSTEM  DYNAMICS 

Neel  K.  Mani 
Edward  J.  Haug 

Center  for  Computer  Aided  Design 
College  of  Engineeri. 

University  of  Iowa 
Iowa  City,  Iowa'  52242 


ABSTRACT »  A  computer-based  method  for  solution  of  non-linear, 
constrained  differential-algebraic  equations  of  motion  of  mechanical 
systems  is  developed.  The  differential  equations  of  motion  and  non¬ 
linear  holonomic  constraint  equations  are  written  in  terms  of  a  maximal 
set  of  Cartesian  generalized  coordinates,  to  facilitate  the  formulation 
of  constraints  and  forcing  functions.  Singular  Value  Decomposition  of 
the  constraint  Jacobian  matrix  is  used  to  generate  a  coordinate 
transformation  that  defines  a  new  set  of  generalized  coordinates  that 
are  naturally  partitioned  into  independent  and  dependent  sets,  with 
several  desirable  properties.  This  information  is  used  to  construct  a 
reduced  system  of  independent  differential  equations  of  motion  that  can 
be  integrated  using  standard  numerical  integration  algorithms.  It  is 
also  shown  that  the  method  speeds  the  iterative  solution  of  dependent 
generalized  coordinates  from  constraint  equations.  A  physically 
reasonable  method  Is  presented  to  determine  when  the  choice  of 
independent  generalized  coordinates  needs  to  be  changed.  A  tracked 
vehicle  example  is  presented  to  Illustrate  the  method  and  Its  advantages 
over  other  methods  of  solution. 

1 .  INTRODUCTION .  The  availability  of  dynamic  analysis  codes  for 
high  speed  digital  computers  has  greatly  increased  the  dimension  of 
systems  that  can  now  be  analyzed.  The  most  convenient  method  of 
defining  the  kinematics  of  larp  scale  systems  is  in  terms  of  a  maximal 
set  of  Cartesian  generalized  coordinates,  which  must  satisfy  kinematic 
constraints.  Such  a  coordinate  system  may,  however,  not  be  best  suited 
for  solution  of  the  equations  of  motion.  This  paper  presents  a  singular 
value  decomposition  algorithm  that  transforms  the  Cartesian  generalized 
coordinates  into  a  system  of  coordinates  that  is  well  suited  for 
solution  of  the  equations  of  motion.  Such  an  approach  permits  easy  user 
representation  of  complex  mechanical  systems,  without  loss  of  accuracy 
during  solution  of  the  system  equations  of  motion. 

To  develop  and  illustrate  the  theory,  planar  dynamic  systems  are 
considered.  The  method  is  equally  applicable  to  spatial  dynamics..  A 
typical  rigid  body  (denoted  body  i)  in  the  plane  is  shown  in  Fig.  1, 
with  a  centroidal  body-fixed  coordinate  system  represented  by 

and  nj  axes.  The  X  and  Y  axes  are  an  inertial  Cartesian  coordinate 
system.  The  position  and  orientation  of  the  body  in  the  X-Y  plane  are 
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specified  by  giving  the  coordinates  x^  and  y^  of  the  body  centroid  and 
the  angle  that  the  axis  makes  with  the  global  X  axis.  Thus,  for 
body  i,  a  generalized  coordinate  vector  q1  is  defined  as 
q*  -  (x,  y j  <J>.  ] 1 .  The  corresponding  generalized  force  on  body  i  is 
i  i  i  i  T  i  i 

Q  ■  [Q  Q  Q  ]  ,  where  Q  and  Q  are  the  X-  and  Y-  components  of  force 
acting  at  origin  of  tfie  coordinate  system  and  Q*  is  the 

torque  acting  on  the  body.  Tne  complete  vector  of-generaliZed 

coordinates  for  the  system  is  given  by  q  *  [q  q  ...q  ],qC  Rn» 
1  T  oT  «I1»T  nt 


T  T 

and  Q  *  IQ1  Q2  .. 
system  and  n  =*  3NB. 


] ^ ,  where  NB  is  the  number  of  bodies  in  the 


Kinematic  constraints  between  bodies  in  the  system  can  be  written 
[1]  as 

T  T  T 

*  =  [*l  (t,q)  *2  (t,q)  ...  ^(t.q)]*  -  0  (1) 


where  there  are  m  constraint  equations.  To  ensure  that  the  constraints 

are  independent,  it  is  required  that  the  mxn  Jacobian 

matrix  l  =  [3*,/3q,]  be  of  full  row  rank, 
q  i 

The  kinetic  energy  of  the  system  can  be  written  as 


T  -  {  qTMq 


where  M  is  the  mass  matrix  of  the  system.  The  mass  matrix  of  body  i  can 
be  written  as  M1  «  Diag  [m*  J^],  where  m^  and  J,  are  the  mass  and 

moment  of  inertia  of  body  1.  The  system  mass  matrix  can  then  be  written 


M  -  DiagfM1  M2  ...  fP] 


Lagrange *8  equations  of  motion  for  a  constrained  system  with 
workless  constraints  are  [2,3] 


Mq  +  «>*A  -  Q 


where  X  is  a  vector  of  Lagrange  multipliers.  The  initial  conditions  at 
time  t  *  tp  are  specified  as 


•  -0  { 


738 


:  "ViTVi. 


JV  *-~L  •.’i  *.V-X-\. 


isrrcncir. 


where  the  initial  position  q®  and  velocity  q^  must  be  consistent  with 
system  constraints. 

Differentiating  the  constraint  equations  of  Eq.  1  with  respect  to 
time  yields  the  velocity  equation 

$  q  +  a  a  0  (6) 

q1  t 

Differentiating  Eq.  6  with  respect  to  time  yields  the  acceleration 


equation 

v =  ~  (yy  (7) 

Equation  4  is  combined  with  Eq.  7  to  give  a  system  of  matrix 
equations  for  accelerations  and  Lagrange  multipliers  that  can  be  written 
as 
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This  combined  system  of  equations  of  motion,  the  kinematic  constraints 
of  Eq.  1,  and  the  initial  conditions  of  Eq.  5  completely  determine 
dynamic  response  of  the  system.  Equations  8  and  1  represent  a  system  of 
mixed  Differential-Algebraic  equations  (DAE). 


2.  EXISTING  METHODS  FOR  SOLVING  SYSTEMS  OP 

MIXED  DIFFERENTIAL-ALGEBRAIC  EQUATIONS.  Systems  of  DAE  can  not 
normally  be  solved  by  couvention  numerical  methods  that  are  used  to 
solve  ordinary  differential  equations  [4].  Several  methods  have  been 
developed  to  solve  sets  of  DAE.  Gear  [5]  h as  proved  that  his  stiff 
Integration  algorithm  can  be  used  to  solve  certain  kinds  of  D/E. 

Certain  other  DAE  systems  can  be  converted  into  equivalent  forms  that 
are  solvable  by  Gear's  method.  There  exist,  however,  systems  of  DaE 
that  can  not  be  solved  by  this  method.  In  Refs.  6  and  8,  this  idea  has 
been  successfully  employed  in  analysis  and  optimal  design  of  mechanical 
and  electrical  networks. 

A  second  approach,  due  to  Baumgarte  [9],  uses  ideas  from  feedback 
control  theory  to  construct  a  modified  differential  equation  that 
implicitly  accounts  for  constraint  equations.  A  difficulty  in  this 
approach,  however,  is  the  selection  of  certain  factors  that  influence 
the  accuracy  of  solutions.  Proper  values  of  these  factors  are  not  known 
and  experts  are  in  disagreement  over  suitable  values.  Furthermore, 
there  is  no  way  to  impose  positive  error  control  on  constraint 
violations  for  this  method. 


739 


The  major  drawback  of  the  two  methods  described  so  far  is  that  all 
n  generalized  coordinates  must  be  integrated,  whereas  only  n-o  of  them 
are  independent.  Furthermore,  m  is  often  large  and  n-m  is  small.  Thus, 
it  would  be  attractive  to  integrate  for  only  n-m  independent  generalized 
coordinates.  Solution  of  the  remaining  m  dependent  generalized 
coordinates  from  constraint  equations  can  then  ensure  positive  error 
control  on  constraint  violations.  The  n-m  generalized  coordinates  that 
are  computed  by  integrating  the  equations  of  motion  are  called 
"independent  generalized  coordinates"  and  the  remaining  coordinates  are 
called  "dependent  generalized  coordinates".  Wehage  and  Haug  [1] 
developed  an  algorithm  to  pick  an  acceptable  choice  of  independent 
generalized  coordinates.  LU  decomposition  of  the  constraint  Jacobian 
matrix  is  used  to  identify  independent  and  dependent  coordinates.  This 
algorithm  performs  satisfactorily,  but  may  lead  to  poorly  conditioned 
matrices,  requiring  considerable  care  to  obtain  accurate  solutions.  It 
is  then  required  that  the  LU  factorization  be  repeated  and  a  new  set  of 
independent  generalized  coordinates  selected.  The  result  is  an  increase 
in  computer  time  and  greater  propagation  of  integration  error  than 
desired. 

A  broader  class  of  vectors  zl  of  independent  generalized 
coordinates  can  be  generated  from  the  vector  q  of  physical  generalized 
coordinates  by  the  transformation 


zl  =  VI  q 


(9) 


where  VI  is  an  (n-m)xn  transformation  matrix.  In  the  algorithm  of  Ref. 
1,  matrix  VI  is  a  Boolean  matrix.  Therefore,  only  individual 
generalized  coordinates  are  chosen  as  independent.  Use  of  a  Boolean 
transformation  matrix  to  pick  independent  generalized  coordinates  from  q 
is,  however,  restrictive.  It  is  desirable  to  enlarge  the  set  of 
possible  independent  generalized  coordinates  by  allowing  matrix  VI  to  be 
more  general.  Such  a  choice  of  VI  can  allow  linear  combinations  of 
physical  generalized  coordinates  to  be  selected  as  independent 
generalized  coordinates.  If  the  rows  of  VI  are  mutually  orthogoral,  the 
resulting  linear  combinations  of  individual  generalized  coordinates  will 
be  mutually  independent.  Specific  form3  of  such  a  transformation  can  be 
chosen,  based  on  properties  of  the  DAE  under  consideration  and  desired 
properties  of  the  transformed  coordinates. 


3.  USES  OF  SINGULAR  VALUE  DECOMPOSITION.  Singular  value 
decomposition  (SVD)  has  been  extensively  used  to  solve  multivariate 
regression  problems  that  may  have  multicoliinearity  between  independent 
variables  [10,11].  This  method,  known  as  Principal  Components  Analysis, 
defines  <.  new  set  regressors  as  linear  combinations  of  the  original 
independent  variables  to  most  closely  follow  the  pattern  made  by  data 
points.  The  new  regressors  also  account  for  dependence  between  the 
original  independent  variables.  SVD  has  also  been  used  for  solution  of 
Least  Squares  Problems  in  which  linear  combinations  of  the  unknowns  are 
used  as  a  new  set  of  unknowns  to  solve  the  problems  [12]. 


Therefore, 


T  n* 

where  0  is  the  diagonal  matrix  D  D.  Hence,  columns  of  VA  Irows  of  V) 
are  the  orthonormal  eigenvectors  of  the  symmetric  matrix  [14]. 

The  singular  values  of  a  matrix  are  very  stable  with  respect  to 
perturbations  of  its  elements  [12].  They  are  also  the  most  reliable 
indicators  of  the  conditioning  of  the  matrix  [14].  Gaussian 
elimination,  on  the  other  hand,  can  not  detect  ill-conditioning  or  rank 
degeneracies  due  to  small  errors  in  entries  of  matrices.  Consider,  for 
example,  two  matrices  ^  and  Bn  of  dimension  n  such  that 

bu  ‘ l!  au  ■ l- 1 '  *•  — • " 

bu  " 1;  “ij "  "*• 3  * 1  + 1 . .  1 ' 1 . ” 

by  -  0;  ay  -  o,  J  -  1 . 1  -  1;  i  -  2 . n 


These  upper  triangular  matrices  are  of  rank  n.  However,  matrix  An  +  En 
is  of  rank  n-1,  where  ER  is  an  n  dimensional  matrix  with  the  nth  element 
in  the  first  column  equal  to  -2  -n  and  all  other  elements  equal  to 
zero.  On  the  other  hand,  latrix  Bn  +  En  is  of  rank  n.  Therefore, 
diagonal  entries  of  an  vvcr  or  lower  triangular  matrix  do  not  indicate 
existence  of  ill-conditioning.  Since  smell  errois  enter  into  matrix 
entries  due  to  truncation  of  numbers  in  computer  representation,  rank 
degeneracy  of  a  matrix  is  a  serious  problem  in  numerical  computation. 

The  most  reliable  method  to  determine  the  rank  of  a  matrix  is  to  look  at 
its  smallest  singular  values.  For  matrix  A_  when  n  is  equal  to  25,  the 
two  smallest  singular  values  are  1.50033427/0  and  8.94x10  ,  H.t  for 
matrix  Bn  the  two  smallest  singular  values  are  0.503818' 889  and 
0.5009501377.  The  number  of  zero  singular  values  indicates  the  order  of 
rank  deficiency.  A  very  small  singular  value  indicates  ill-conditioning 
of  the  matrix  [12,14]. 

Even  though  x,he  singular  values  are  the  square  roots  of  eigenvalues 
of  the  matrix  *  *  ,  they  must  not  be  determined  this  way,  since  small 
eigenvalues  wil?  $e  reduced  to  rounding  error  proportions  and  be  lost  in 
the  computing  process.  A  singular  value  decomposition  algorithm  due  to 
Golub  and  Kahan  [14]  is  based  on  a  special  adaptation  of  the  QR 
algorithm.  The  singular  value  decomposition  is  computed  in  two 
stages.  In  the  first  stage,  matrix  #  is  transformed  into  a  bidiagonal 
matrix  B  bv  a  sequence  of  Householder^transformations.  The  second  stage 
of  computation  is  application  cf  a  specially  adapted  QR  algorithm  to 
compute  the  singular  value  decomposition  of  B.  The  first  part  of  this 
algorithm  is  direct  and  the  second  part  is  iterative.  This  algorithm  is 
available  in  the  form  of  computer  software  in  Refs.  15  to  17. 


5.  USE  OF  SINGULAR  VALUE  DECOMPOSITION  FOR  GENERALIZED 

COORDINATE  PARTITIONING.  One  may  define  a  new  variable  z,  such 


that 


This  is  an  orthogonal  transformation  of  coordinates  that  gives  a  new 
vector  z  of  generalized  coordinates  for  the  system.  Since  matrix  V  is 
orthogonal,  hence  non  singular,  there  exists  a  one-to-one  correspondence 
between  z  and  q,  locally.  From  now  on,  generalized  coordinates  q  will 
be  referred  to  as  physical  coordinates  and  generalized  coordinates  z 
will  be  called  composite  coordinates. 

Taking  the  first  time  derivative  of  Eq.  14,  with  the  transformation 
matrix  V  held  constant, 

z  -  Vq  (15) 

The  time  derivative  of  Eq.  15,  gives 


z  ®  Vq 


(16) 


Consider  a  perturbation  6z  (virtual  displacement)  of  z  that  is  to 
be  consistent  with  the  constraint  equations  f'z)  *  0, 


<b  6q  =  ♦  V  6z  =  0 
q  q 


(1-0 


where  6q  =*  V  6z  from  Eq.  14.  From  Eq.  10  and  the  fact  that  V  is 
orthogonal,  Eq.  17  becomes 


U  D6z  -  0 


(18) 


Since  U  is  orthogonal,  one  may  premultiply  Eq.  18  by  U  and  use  the  form 
of  D  in  Eq.  11  to  obtain 


[T 
e,6z,  e«6z_  ...  e  6z  1  -0 

lit/  m  mJ 


(19) 


This  shows  that  Sz^  +  •  •«,  6z  can  not  be  computed  from  Eq.  18. 

Hence,  fiz^  ^  ...,  5z  can  be  evaluated  only  from  the  differential 

equations  or  motion.  Therefore,  zffl  +  j,  ...,  zfl  are  selected  to  be  the 
Independent  generalized  coordinates  for  solution  of  the  equations  of 
motion  and  Zj ,  ...,  zm  are,  be  default,  the  dependent  generalized 
coordinates  that  must  be  computed  from  the  constraint  equations.  Since 
the  kinematic  constraint  equations  are  nonlinear,  an  iterative  technique 
such  as  the  Newton-Raphson  method  is  required  to  solve  for  the  dependent 
generalized  coordinates. 
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An  important  point  in  using  generalized  coordinate  partitioning  for 
solution  of  DAE  is  to  obtain  a  criterion  that  determines  if  a  set  of 
independent  generalized  coordinates  is  acceptable  from  a  computational 
point  of  view.  Each  redefinition  of  independent  generalized  coordinates 
requires  restarting  the  predictor-corrector  method  [18]  used  for  their 
integration,  which  requires  substantial  computational  effort.  On  the 
other  hand,  not  using  the  appropriate  choice  of  independent  generalized 
coordinates  introduces  avoidable  computational  errors. 


6.  PROPERTIES  OF  GENERALIZED  COORDINATE  fARTITIONING  RISING 

SINGULAR  VALUE  DECOMPOSITION.  Considering  only  constraints  for 
which  =  0,  Eq.  6  can  be  written  as 


$  = 


4>  q 

q 


(20) 


Premultiplying  Eq.  20  by  U  and  usi ig  the  definition  of  z  in  Eq.  15, 


Dz  =  0 


Due  to  the  special  form  of  matrix  D  (Eq.  11),  this  is 


r  •  •  •  iT  _ 

KZ1  e2z2  -*  Vm)  "  ° 


(21) 


Since  the  e’s  are  not  zero  for  a  Jacobian  matrix  with  full  row  rank,  Eq. 
21  implies 


'1  2 


. . .  z  ]  =0 

mJ 


(22) 


Since  orthogonal  transformations  preserve  norm. 


(23) 


and  since  z 


1* 


z  are  equal  to  zero,  Eq.  23  is 


n 

r 

L 


i=mrl 


=: 


(24) 


If  one  defines  Unit  Mass  Kinetic  Energy  of  a  system  as  the  kinetic 
energy  when  the  system  has  unit  masses  and  unit  moments  of  inertia,  Eq. 
24  indicates  that  the  unit  mass  kinetic  energy  due  to  independent 
composite  coordinates  captures  the  entire  unit  mass  kinetic  energy  of 
the  system.  Thus,  one  can  locally  generate  all  motion  in  the  system 
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with  a  smaller  number  of  coordinates  (equal  in  number  to  the  number  of 
degrees  of  freedom).  This  property  can  be  used  to  develop  a  criterion 
that  determines  when  to  redefine  independent  composite  coordinates.  As 
time  progresses,  after  a  new  set  of  independent  composite  coordinates 
has  been  selected,  the  dependent  composite  coordinates  acquire  small 
non-zero  values,  resulting  in  a  reduction  of  unit  mass  kinetic  energy 
captured  by  the  independent  composite  coordinates.  Therefore,  if  the 
unit  mass  kinetic  energy  captured  by  the  independent  composite 
coordinates  is  less  than  a  predetermined  -fraction  of  the  total  unit  mass 
kinetic  energy  of  the  system,  a  new  set  of  independent  composite 
coordinates  is  defined. 

from  Eqs.  14  and  15,  the  vector  of  virtual  displacements  6q  and  the 
vector  of  velocities  q  can  be  expressed  in  terms  of  the  rows  of  V; 

n  T 

fiq  =  l  (25) 

j=m+l  3  J 

and 

q  -  l  z.vj  (26) 

j=m*l  3  3 

Thus,  displacement  is  confined  to  a  subspace  of  Rn  that  is  spanned 

T  T  T  T 

by  V  ,  ...,  V  and  velocities  along  V, ,  ...,  V  axes  are  zero. 
tfH-i  n  i  m 

Therefore,  to  first  order,  the  independent  composite  coordinates  are 

Lagrangian  coordinates  that  locally  generate  all  system  information. 

Matrix  V  of  Eq.  10  may  be  partitioned  into  submatrices  VI  and  VD, 
representing  the  independent  and  dependent  portions  of  matrix  V, 

V  ■  [™r]  n  -  .  <27> 

It  can  be  shown  that  rows  of  matrix  VI  are  orthogonal  to  rows  of  the 
Jacobian  matrix  [19].  Therefore,  integrating  for  independent  composite 
coordinates  moves  the  system  along  a  tangent  hyperplane  of  the 
constraint  surface. 

Consider,  the  slider  crank  mechanism  shown  in  Fig.  2(a),  initially 
in  position  ABC,  Let  <t>j ,  the  angular  coordinate  of  body  1,  be  selects.* 
s  the  independent  generalized  coordinate  for  integration  of  the 
equations  of  motion.  Prediction  of  <f>^  moves  body  1  to  pc  sition 
A’B'  (Fig.  2(a)},  breaking  re 'olute  joints  at  A’  and  B*.  Iterative 
solution  of  the  dependent  generalized  coordinates  from  constraint 
equations  moves,  the  system  to  position  AB"C" >  which  involves 
considerable  movement  of  all  bodies.  Duless  an  efficient  and  accurate 
prediction  of  initial  estimates  for  dependent  generalized  coordinates  is 
made  prior  to  iterative  solution,  a  large  number  of  iterations  is 
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required  and,  under  some  conditions,  the  iterative  method  may  rail  to 
converge . 

During  integration  for  independent  composite  coordinates,  all 
phyoical  coordinates  are  advanced  very  close  to  their  true  values 
(Fi'  2(b)),  along  a  tangent  hyperplare  to  the  constraint  surface. 
The^!ore,  the  estimate  of  dependenc  composite  coordinates  is  accurate 
and  either  none  or  just  one  Newton  Raphson  iteration  Is  required  for 
convergence . 

To  compute  physical  coordinates  and  velocities  after  independent 
composite  positions  and  velocities  have  been  computed,  matrix  equations 
of  the  form 


(28) 


need  to  be  solved  [1,19].  It  can  be  shown  that  the  c  efficient  matrix 
of 

Eq.  28  can  not  be  ill-conditioned  if  the  Jacobian  matrix  is  not  ill- 
conditioned  [19]. 


It  has  been  observed,  during  simulation  of  several  mechanical 
systems,  that  c;  tposite  generalized  coordinates  remain  an  acceptable 
choice  of  independent  coordinates  for  a  longer  period  of  simulation  time 
than  do  physical  generalized  coordinates  based  on  the  LU  partitioning. 


7.  ALGORITHM  FOR  SOLUTION  OF  EQUATIONS  OF  MOTION.  In  this 
section,  an  algorithm  to  solve  *-he  equations  of  motion  of  dynamic 
mechanical  j/3tems  using  the  method  developed  in  Sec.  5  is  presented. 
Generalized  coordinates  are  partitioned  into  independent  and  dependent 
parts,  using  the  SVD  method  of  Sec.  5.  Subroutine  DE,  which  is  based  ou 
an  Adams-Bashforth  predictor-corrector  [18]  method,  is  employed  to 
integrate  for  the  independent  composite  coordinates.  Newton  Raphson 
iteration  is  used  to  compute  physical  coordinates  from  the  independent 
composite  coordinates  at  every  cime  step.  This  information  is  used  to 
compute  physical  velocities  from  the  independent  composite  velocities 
and  finally  to  solve  for  the  vector  of  physical  accelerations.  The 
algorithm  is  as  follows: 

Let  i.  initially  0,  be  an  indicator  of  the  current  time  step;  i.e., 
i  =  0  implies  t  *  tQ. 

(1)  Read  initial  position,  velocity,  and  other  system  data. 
Construct  the  Jacobian  $  .  Consider  the  n-m  user  supplied  positions  v 
to  be  accurate  and  correct  the  position  vector  q  j^using  Newton  Raphson 
iteration  froj  Eqs.  29  and  27 <.  Velocity  vector  ^  is  computed  from  Eq. 
31,  considering  the  n-n  user  supplied  velocities  v  to  be  accurate. 
Boolean  matrix  B  fixes  the  user  specified  positions  and  velocities  to 
prescribed  values, 
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Here,  k  is  an  iteration  counter.  Equation  29  computes  successive 
corrections  to  q,,  .  until  all  kinematic  constraints  are  satisfied,  to 
required  accuracy. 

(2)  Factor  using  singular  value  decomposition  of  Sec.  4,  as 


4>  «  U  DV 

q 


Partition  V  into  Independent  and  dependent  portions  (See  Sec.  5), 


■•[-a-] 


(3)  Calculate  q  and  A  from  the  acceleration  equation  (Eq.  8, 
presented  here  for  reference). 


L  %  °J  LSJ  HvV  •  s«  -* t*qe H J 

j  (4)  Calculate  independent  composite  position  *1*,  'elocity 
zl  ,  and  acceleration  zl  (using  the  definition  of  zl  from  Eq.  14)  as 


[zl1  Zl1  Zl1]  *  [Vljfq1  qx  q1] 

(5^+1Inte|^te  [zl,  r;l],  with  [zi1,  zl1]  as  initial  conditions,  to 
get  [zl  ,  zl  ]  at  t^+1  =  ^  4  At,  using  Adaas-Bashforth  predictor- 
corrector  integration. 

(6)  Predict  qi+1  and  correct  it  iteratively  by  Newton  Raphe on 
iteration,  using  Eqs.  35  and  36,  until  constraints  are  satisfied  to 
required  accuracy. 


•  i+1 

(7)  Compute  q  from  the  velocity  equation  (Eq.  6  presented  here 
for  reference), 

*q  .i+1  I  0 

(8)  Compute  acceleration  q  and  Lagrange  multiplier  X  from  Eq.  34. 

(9)  Set  i  =  i  +  1  and  t^.j  =  t_^  +  At.  If  ||zlj|  is  less  than  a 
predetermined  fraction  of  ||q{|„  (See  Eq.  24),  matrix  VI  needs  to  be 
updated.  If  so,  repeat  steps  2-9.  If  VI  is  still  acceptable,  repeat 
steps  3-9. 

8 .  NUMERICAL  EXAMPLE .  To  illustrate  the  method  developed, 
dynamics  of  an  Ml 13  tracked  personnel  carrier  going  over  a  bump  is 
simulated.  The  vehicle  is  modeled  by  11  bodies,  as  shown  In  Fig.  3. 

The  global  coordinate  system  is  located  at  the  front  of  the  vehicle,  as 
shown  in  Fig.  4.  The  local  coordinates  of  the  wheels  and  the  chassis 
are  located  at  their  centers  of  mass  and  are  initially  oriented  parallel 
to  the  global  X-axis.  The  road  arms,  however,  are  assumed  to  have  their 
centers  of  mass  at  the  points  where  they  art  attached  to  the  chassis  by 
revolute  joints.  The  £-axes  of  their  local  oordinates  are  parallel  to 
the  axes  of  the  road  arms  (Fig.  4). 

Body  1  is  the  chassis,  with  weight  22,449  lb  and  pitch  moment  of 
inertia  133,000  lb-in. -s  .  Bodies  2-6  are  the  wheels,  each  with  weight 
180  lb  and  moment  of  inertia  90  lb-ft  .  The  wheels  are  attached  to  the 
chassis  by  road  arms  (bodies  7-11).  Bodies  7,  8,  and  11  each  have 
weight  44  lb  and  moment  of  inertia  15.86  lb-ft  .  Bodies  9  and  10  each 
have  weight  22  lb  and  moment  of  inertia  7.93  lb-ft  . 

Torsional  springs,  each  with  a  spring  rate  of  70,000  in.-lb/rad., 
are  attached  between  the  chassis  and  road  arms.  In  addition,  a  bump 
stop  prevents  contact  between  the  leading  road  arm  (body  7)  and  the 
chassis.  The  bump  stop  is  activated  if  body  7  moves  by  an  angle  greater 
than  0.2090  radians,  counterclockwise.  The  stiffness  of  the  spring  that 
models  the  bump  stop  is  5.76x10®  lb/ in.  Fluid  dampers  are  connected 
between  the  chassis  and  bodies  7,  8,  and  11,  with  damping 
characteristics  shown  in  Fig.  5.  To  prevent  adjacent  wheels  from 
penetrating  one  another,  logical  springs,  each  with  stiffness  10,000 
lb/ in.  in  compression  and  zero  in  tension,  are  modeled  between  them. 

The  track  is  modeled  as  a  series  of  spring-dampers,  with  spring  constant 
10'*  lb/in.  and  damping  coefficient  41.7  lb-sec/in.  A  pretension  of 
2,250  lb  is  applied  to  the  track. 
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The  vehicle  is  initially  at  rest  in  the  configuration  given  by 
Table  l. 


Table  i.  Initial  Configuration  of  Mi  13 


Body  Ho. 

X 

Position 

y 

♦ 

1 

6.424830 

3.594392 

0.0098366 

2 

2.207953 

1.142995 

-0.0002644 

3 

4.380040 

i.137461 

-0.0004955 

4 

6.555254 

1.138714 

-0.0007750 

5 

8.729923 

1.140114 

-0.0010999 

6  ‘ 

10.90677 

1.145697 

-0.0015061 

7 

1.293782 

? .642426 

-0.5000145 

8 

3.481177 

1.663943 

-0.5298527 

9 

5.668571 

1.685460 

-0.5525503 

10 

7.855965 

1.706978 

-0.5754019 

il 

10.04336 

1.728495 

-0.5937436 

To  simulate  the  vehicle  moving  over  a  semi-circular  obstacle  of  8 
in.  radius,  the  x  coordinate  of  the  chassis  is  held  fixed  and  the 
terrain  is  moved  under  it  at  a  speed  of  20  ft/sec.  The  VI  matrix  at  the 
Initial  time  is  given  in  Table  2. 

The  simulation  was  carried  out  for  a  period  of  2  seconds.  The 
position,  velocity,  and  acceleration  of  the  vertical  coordinate  of  the 
chassis  are  plotted  versus  time  in  Figs.  6-8. 
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For  Chis  problem,  solution  using  the  SVD  partitioning  method  did 
not  require  redefinition  of  the  independent  coordinate  for  the  entire 
simulation  period.  Furthermore,  no  Newton -Raphson  iterations  were 
required  to  compute  physical  coordinates  from  the  Independent  composite 
coordinates.  When  LU  partitioning  was  used  to  solve  this  problem, 
independent  generalized  coordinates  were  redefined  several  times. 

During  integration  for  independent  generalized  coordinates,  using 
subroutine  DE  [18],  the  tightest  error  tolerance  that  can  be  imposed  for 
that  set  of  equations  is  determined.  If  the  user  specified  tolerance  is 
less  than  this  value,  subroutii.  increases  the  error  tolerance.  For 
Integration  of  the  Independent  com*.,  oite  coordinates,  subroutine  DE 
increased  the  error  tolerance  to  0.0032.  The  maximum  allowed  error 
tolerance  computed  by  subroutine  DE  for  integrating  the  independent 
physical  coordinates  was  0.0128,  four  times  the  tolerance  for  solution 
by  the  SVD  method.  This  suggests  that,  for  this  problem,  the 
independent  composite  coordinates  are  better  than  independent  physical 
coordinates,  from  an  integration  point  of  view. 
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APPLICATION  OF  SCREW  CALCULUS  TO  THE  EVALUATION  OF 
MANIPULATOR  WORKSPACE 
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ABSTRACT.  In  this  paper,  an  analytical  techique  is  presented  that  is  based 
on  screw  calculus  and  dual -number  matrices  to  derive  the  kinematic  equations  and 
the  workspace  formulations  of  robotic  manipulators.  A  computational  procedure 
for  the  quantitative  evaluation  of  workspace  volume  is  also  developed.  Several 
examples  are  chosen  to  demonstrate  the  usefulness  and  the  effectiveness  of  the 
approach. 

1.  INTRODUCTION.  One  of  the  basic  problems  encountered  In  manipulator 
design  is  the  determination  of  the  shape  of  workspace  and  its  characteristics. 

The  workspace,  which  is  the  zone  of  operation  of  a  manipulator,  is  the  space 
associated  with  possible  positions  and  orientations  of  the  last  link  of  a 
manipulator.  A  knowledge  on  the  workspace  of  a  manipulator  can  provide  a 
measure  of  the  efficiency  of  the  design.  Therefore,  the  investigation  on 
workspace  is  of  fundamental  interest. 

There  have  been  few  investigations  of  the  subject  on  the  record.  A  summary 
is  given  in  references  [1,2].  Several  approaches  have  been  used  dealing  with 
computational  development  of  workspaces,  such  as  iterative  (Kumar  and  Waldron 
31,  Sugimoto  and  Duffy  [4];  Tsai  and  Soni  [5]);  grid-scanning  (Lee  and  Yang 
lj);  and  lately,  a  non-iterative  generation  scheme  (Hansen,  Gupta  and  Kazerounian 
2])  and  tl.a  technique  based  on  Gauss'  divergence  theorem  (Jou  and  Waldron  [8]). 
lost  investigators  involve  the  use  of  conventional  [4x4]  matrices  method  of 
Hartenberg  and  Denavit  [b]  for  the  analytical  representation  and  generation  of 
workspace.  Two  problems  are  of  major  concern:  one  is  computational  efficiency; 
the  other  is  the  limitation  of  most  conventional  methods  which  are  applicable 
only  to  manipulators  with  revolute  joints.  Manipulators  with  prismatic  joints, 
which  ere  common,  and  other  special  kinematic  pairs  such  as  cylindrical  and 
screw  pairs  are  very  difficult  to  evaluate  as  far  as  the  workspace  is  concerned. 

In  most  of  these  cases,  methods  are  either  not  available,  or  they  are  computationally 
too  inefficient  tn  be  useful. 

An  analytical  technique  is  presented  in  this  investigation.  It  is  based  on 
screw  calculus  and  dual-number  matrices,  to  derive  the  kinematic  equations  and 
the  workspace  formulations  of  robotic  manipulators.  The  application  of  the 
method  to  the  study  of  the  kinematics  and  dynamics  of  manipulation  is  relatively 


1  Present  address:  Department  of  Mechanical  Engineering,  State  University  of  New 
Yurk  at  Stony  Brook,  Stony  Brook,  New  York  11794 


765 


PREVIOUS  PAGfi. 
IS  BLANK 


wv  wrw  wv  *  tkecti’ v 


;  KT1  <r.-\  M  * 


n*  ?  ivi’w'tvjv  L"v  r« 


'U  wv  - 


new  [6,7].  However,  its  application  to  the  representation  of  manipulator 
workspace  has  not  been  explored. 

The  method  of  screw  calculus  offers  many  advantages,  especially  in  dealing 
with  three-dimensional  kinematics.  In  these  cases  the  effort  to  derive  closed 
form  analytical  expressions  is  often  Impaired  by  laborious  or  even  insurmountable 
algebraic  manipulations.  The  use  of  compact,  screw  notations  and  dual  formulation 
facilitate  substantially  the  effort  on  algebraic  manipulations.  Consequently, 
concise  expressions  can  be  formulated,  which  would  provide  valuable  geometric 
insight  Into  the  rigid  body  under  constrained  spatial  motion.  In  addition, 
there  is  significant  computational  advantage.  Dual  ntonber  operations  not  only 
facilitate  analytic  work,  but  also  make  computer  programming  simpler  and  more 
efficient  via  dual  subroutines,  namely.  Incorporating  algebraic  operations  of 
dual  numbers  and  trigonometric  functions  of  dual  angles  into  computer  subroutines. 

By  declaring  all  the  motion  variables  to  be  complex,  *e  are  able  to  perform 
algebraic  manipulations  on  the  rotational  part  (i.e..  real  part)  and  translational 
part  (i.e.,  imaginary  part)  of  each  motion  variable  at  the  same  time  in  a  dual 
equation. 

In  this  paper,  a  new  workspace  generation  technique,  using  the  optimum  path 
search,  is  proposed.  The  outlining  of  the  p» ejection  of  workspace  on  a  specified 
plane  involves  the  determination  of  a  minimum  distance  between  the  end-effector 
of  a  manipulator  and  a  specified  target  point  on  the  plane  and  the  search  is 
converted  into  an  optimization  problem.  An  optimization  technique  Is  used, 
which  Is  the  FMFP  code  of  Fletcher  and  Powell  [9].  The  algorithm  thus  developed 
is  a  partial -scanning  method  to  generate  the  workspace.  It  is  shown  that  the 
algorithm  ensures  a  significant  reduction  of  scanning  points  and  provides 
improved  computational  efficiency  In  especially  complicated  cases  as  compared  to 
the  conventional  scanninq  technique  [1].  the  algorithm  developed  here  Is 
applicable  to  manipulators  having  not  only  the  revolute  joints  but  also  the 
prismatic  and  cylindrical  joints.  With  slight  extension,  it  can  Include  also 
the  screw  joints.  An  algorithm  for  non-revolute  type  of  joints  for  the  quantitative 
evaluation  of  workspace  is  seldom  seen,  at  least  to  the  author’s  knowledge. 

In  the  following,  we  first  begin  with  a  description  of  the  underlining 
principles  of  the  new  technique  and  how  the  algorithm  works.  Then  a  comparative 
study  Is  performed  with  a  previously  established  technique  [X]  on  how  Is 
computation  time  affected  with  accuracy.  Finally,  the  algorithm  Is  applied  to 
evaluate  several  workspace  boundary  profiles  and  the  results,  whenever  possible, 
are  compared  with  previously  published  date.  [1,10]. 

2.  MANIPULATOR  DESCRIPTION..  A  manipulator  with  n  joints  In  series  can  be 
represented  schematically  as  shown  in  Tig.  1.  The  motion  of  joint  n  consists  of 
a  rotation  of  en  about  axis  Zn,  a  translation  along  Zn,  a  translation  along  the 
current  Xn  axis  of  an,  and  finally  a  lotatlon  of  an  about  the  current  Xn  axis. 

The  purposes  of  en  an  1  b,,  are  a  function  of  the  joint  type.  If  the  joint 
Is  a  revolute,  then  bn  Is  a  constant  and  en  is  variable.  If  the  joint  Is 
prismatic,  then  on  Is  constant  and  bn  Is  variable  (Fig.  2). 

The  transformation  that  performs  the  task  of  moving  one  joint  to  the  next 
is  represented  by  the  product  of  homogeneous  transformation  matrices,  such  as  in 
Paul  [11] 


where 


A„  =  Rot(Z,9_)  Trans(0,0,bn)  Transla-.O.O)  Rot(X,aJ 

n  n  n  n  n 


(i) 


31 

R 

M 
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Rot(2,en)  -  Rotate  about  2-ax Is  en 
Trans(0,0,bn)  -  Translate  bn  In  th  •  Z  direction 
Trans(aR,0,0)  -  Translate  an  In  the  X  direction 


Rot(X,on)  -  Rotate  about  X-axis  an 


Conventionally,  the  [4x4]  matrix  form  of  the  Denavit  and  Hartenberg  [12] 
An  can  be  expressed  as 

cos  e  -sin  e  cos  o 

sin  9  sin  « 

a  cos  e 

sin  e  cos  e  cos  a 

-cos  e  sin  a 

a  sin  e 

(2) 

0  sin  a 

COS  a 

b 

0  0 

0 

1  J 

In  this  investigation,  screw  calculus  and  dual-number  matrices  [13,14]  are 
used  to  derive  the  kinematic  equations  and  workspace  formulations  of 
manipulators.  There  are  several  advantages  ror  this  approach.  The  compact 
screw  notations  and  dual  formulation  facilitate  substantially  the  effort  on 
algebraic  manipulations  and  provide  not  only  concise  expressions  but  also 
significant  computational  advantage.  Using  dual-number  operations,  we  have 


An  =  R0T(Z,in)  R0T(X,in)  (3) 

where 

An  -  Oual  Transformation  Matrix 
8n  -  Oual  Variable  Angle 
«n  -  Oual  Constant  Angle 

Oual  angles  describe  both  motions  of  rotation  and  translation.  9n  and  <in 
relate  to  the  previously  defined  variables  as  follows: 


iii'ii- 


(4) 


3n  =  fe„,b_)  or  e„  +  e  b_ 
n  n  fr  n  n 


£L  =  (a„»aj  or  r  +  e  a,, 
n  '  n  n*  n  n 


The  transformation  can  now  be  rewritten  In  [3x3]  dual  matrix  notation  as 
follows: 


COS  6 

-sin  e  cos  a 

sin  e  sin  o 

- 

sin  § 

cos  9  cos  u 

-cos  e  sc.  _ 

0 

sin  o 

COS  a 

The  dual  multiplication  and  dual  trigonometry  functions  are  defined  by. 

Multiplication:  a  b  *  a  b  +  e(aQb+bQa)  (6) 

Dual  sine:  sln(a)  *  $1n(a)  +  e  a©  cos(a)  (7) 

Dual  cosine:  cos(a)  =  cos(a)  -  e  ao  sin(a)  (8) 

where,  letters  with  angle  symbols  on  top  signify  dual  numbers 

a  =  (a,ao)  or  a  +  e  ao 
b  s  (b,bg)  or  b  +  e  bo 

e  *  The  dual  unit  having  the  property  t2  *  0 
a  s  Real  part  of  a 
ao  =  Dual  part  of  a 
b  =  Real  part  of  b 
bo  =  Dual  part  of  b 

Once  the  transformation  for  one  joint  has  been  defined,  the  transformation  which 
represents  the  relation  from  the  last  joint  to  the  ground  of  a  6- joint  manipulator 
Is  given  as: 

T  -  Ai  Ag  Ag  A^  Ag  Ag  (9) 

An  additional  transformation,  £,  is  used  to  represent  the  relation  from  the  last 
joint  to  the  hand.  This  transformation  Is  a  constant. 


hand  coordinate  In  Eq.  (10). 


(Ground  Coordinates)  =  TE(Hand  Coordinates)  (10) 

If  the  [4x41  method  Is  chosen,  the  location  of  the  hand,  given  the  value  of 
joint  variables,  can  be  determined  by  entering  the  vector  (0,0,0,1)T  of  the 


rvV  **  *  *  , 


un -s*c^«  r :jutcib  x*j’Kcrs.icy.v=x,x=«^=^J^ii^^'»t>:2^^  -szwx.'-iU*  u*  cv.i^Kd7Tii^rr^ccj 


If  the  dual  number  method  is  used,  a  technique  entitled  "The  Transference 
Principle  [15]"  must  be  applied  to  the  transformation  TE  to  obtain  this  information. 

Computer  programing  involving  dual  numbers  is  efficient  via  dual  subroutines, 
namely,  incorporating  algebraic  operations  of  dual  numbers  and  trigonometric 
functions  of  dual  angles  into  computer  subroutines.  By  declaring  all  the  motion 
variables  to  be  complex,  we  are  able  to  perform  algebraic  manipulations  on  the 
rotational  part  (i.e.,  the  real  part)  and  the  translational  part  (i.e.,  the 
imaginary  part)  of  each  motion  variable  at  the  same  time  in  a  dual  equation. 

3.  BASIC  APPP.OACH.  The  basic  approach  involves  the  search  for  an  optimum 
path  between  the  end-effector  of  a  manipulator  and  a  target  position  on  a 
specified  plane  in  which  the  workspace  projection  is  desired.  Subsequently, 
this  minimum  distance,  which  Is  obtained  through  an  optimization  technique,  is 
then  compared  with  the  actual  position  of  the  target  to  determine  the  accessibility 
of  the  manipulator.  The  objective  is  to  provide  an  intelligent  partial  scanning, 
with  significant  reduction  of  scanning  points,  to  gain  computational  efficiency. 

The  method  of  screw  calculus  and  [3x3]  dual-number  matrix  [13, 141  is  the 
basis  for  the  analytical  formulation  of  this  investigation.  This  allots  one  to 
take  full  advantage  of  the  compactness  and  clarity  of  dual  formulation  in 
describing  the  kinematic  characteristics  of  the  motion  of  manipulators. 

In  the  following,  the  basic  concept  on  the  generation  of  workspace  boundary 
and  volume  is  discussed,  then  the  optimum  path  search  technique  and  the  optimization 
problem  are  formulated. 

3 . 1  On  the  Generation  of  Boundary  Profile  and  Volume 

The  Boundary  Profile  Is  defined  as  the  Image  of  the  workspace  produced 
by  fixing  the  first  joint  and  passing  this  workspace  through  a  plane  that  Is 
perpendicular  to  the  motion  produced  by  the  first  joint. 

For  a  manipulator  with  first  joint  revolute,  the  boundary  profile  would  be 
produced  by  taking  all  the  points  in  the  workspace  and  mapping  them  onto  the  Y-Z 
plane  as  follows: 


3-D  points  in  workspace 
( X, Y ,Z )  ==> 


2-D  Y-Z  plane 

I  =  7x2  +  Y 2 

J  =  Z 

(UO) 


(ID 


This  Is  a  circular  projection  on  the  Y-Z  ne  about  l\  [1]. 

For  a  manipulator  with  first  joint  prismatic  the  mapping  procedure  is, 
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V 

Is  J. 


SF* 


(X.Y.Z)  ==>  I  *  X 

J  =  Y  (12) 

(1,0) 

This  is  a  parallel  projection  on  the  X-Y  plane. 

Once  the  area  of  this  projection  is  known,  an  approximation  of  the  workspace 
volume  can  be  established. 

If  the  first  joint  is  a  revolute,  the  volume  Is  approximated  by  integrating 
the  area  as  a  volume  of  revolution  ahout  1\  over  the  limits  of  the  first  joint 
motion.  If  the  first  joint  is  prismatic,  then  the  volume  is  simply  the  area 
times  the  total  joint  excursion. 

This  appm-4mation  may  result  in  error  due  to  the  undetermined  end  shape. 
Fortunately,  n»”  c  -anipulators  usually  have  the  first  two  or  three  links 
coplanat  and  the-  Jinks  determine  the  majority  of  the  workspace.  Error  is  then 
due  to  the  last  sorter  links  resulting  in  small  error. 

3.2  The  Optimum  Path  Search  Technique 

As  it  was  mentioned  earlier,  the  present  method  for  workspace  generation 
follows  the  grid-scanning  approach  of  Lee  and  Yang  [1].  However,  there  is  a 
basic  difference  in  performing  the  scanning.  The  new  method  uses  en  optimization 
technique  which  facilitates  the  scanning  process  and  reduces  the  number  of 
scanning  points.  Consequently,  It  Is  a  partial -scanning  technique.  In  the 
following  the  new  method  is  compared  with  the  full -scanning  technique  of  Lee  and 
Yang  [1]  and  the  basic  formulations  of  the  Optimum  Path  Search  Technique  are 
presented. 

( 1 Scanning  Technique  of  Lee  and  Yang  f  1 1 

This  technique  which  Is  explained  In  the  following  consists  of 
scanning  through  all  the  joint  variables  by  means  of  nested  loops,  using 
appropriately  small  step  sizes  to  ensure  that  the  points  generated  are  dense 
enough  to  fill  all  points  inside  the  workspace  projection. 

•  Define  an  array  that  represents  a  plane  passing  through 
the  workspace. 

•  Scan  through  ail  combinations  of  joint  variables  and  map 
the  location  of  the  hand  onto  the  array. 

•  Find  the  border  of  the  workspace  mapped  on  the  array  and 
trace  it.  This  Is  for  graphic  display. 

•  Perform  volume  calculation. 


One  way  to  estimate  the  step  sizes  for  each  joint  Is  to  configure  the 
manipulator  in  its  "longest"  position  from  the  joint  of  Interest.  The  step  size 
is  determined  by  the  largest  joint  displacement  that  will  result  in  the  motion 
of  the  hand  to  move  from  one  array  entry  to  the  next.  This  calculation  is 
repeated  for  each  joint.  Often  Is  the  case  that  less  than  all  the  joint 
variables  will  play  a  role  in  defining  the  primary  workspace.  An  examole  of 
this  would  be  a  manipulator,  such  as  a  Unlmation  PUMA,  witnout  an  end- effector 
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specified  and  also  having  a  3-degrees-of-freedom  wrist.  This  manipulator 
requires  only  two  joint  variables  to  be  scanned  through.  Remember  that  the 
first  joint  is  taken  into  account  by  the  mapping  procedure  mentioned  in  Section 


It  is  worth  noting  that  when  this  technique  was  published  it  was  exclusively 
for  manipulators  with  revolute  joints.  The  technique  works  quite  well  with 
prismatic  joints  by  making  the  aDproprlate  changes  in  the  transformation 
matrices. 

(2)  The  Optimum  Path  Search  Technique 

The  new  method  differs  from  the  preceding  method,  primarily  because 
the  workspace  generation  technique  is  based  on  an  optimum  path  search  as 
explained  in  the  following: 

•  Also  define  an  array  that  presents  a  plane  passing  through 
the  workspace 

•  Rather  than  scanning  through  the  joint  angles  and  projecting 
the  position  on  the  array,  specify  a  location  on  the  array  and 
perform  inverse  calculations  to  obtain  joint,  angles  by  solving 
the  following  optimization  problem: 

With  the  two  techniques  outlined  In  some  details  a  comparison  can  now  be 
made  and  is  summarized  in  Table  i. 

The  Optimum  Path  Search  Technique  uses  a  recursive  method  to  determine  the 
minimum  of  the  objective  function.  This  can  require  a  great  deal  of  computer 
time,  especially  since  points  outside  of  the  workspace  are  rechecked  a  number  of 
times  (3-5)  before  being  identified  as  such.  However,  with  this  rather  substantial 
penalty  of  large  computer  time,  comes  the  advantage  of  having  to  only  locate 
points  around  the  border  of  tne  workspace.  This  results  in  computational  time 
to  scale  with  the  accuracy  In  terms  of  a  length  scale.  An  example  of  this  would 
be  If  two  runs  were  made,  one  with  a  40x40  array  and  a  second  with  a  80x80 
array,  the  80x80  will  take  approximately  twice  as  long  as  the  40x40.  This  is 
because  the  perimeter  of  the  workspace  has  doubled  in  terms  of  array  entries. 

If  analysis  is  to  be  performed  with  the  joint  variable  scanning  process, 
then  doubling  the  grid  size  results  in  a  multiplication  of  the  computational 
time.  This  factor  is  2  raised  to  the  number  of  joints  active  in  determining  the 
workspace.  This  is  due  to  reduction  of  step  size.  If  two  joint  variables  are 
scanned  then  the  computation  time  would  Increase  by  4.  For  a  very  general 
manipulator  having  five  joints  active  in  defining  the  workspace,  the  calculation 
time  would  be  increased  by  a  factor  of  32. 

For  analysis  of  the  workspace,  the  technique  of  choice  will  depend  on 
accuracy  and  the  number  of  active  variables  present.  With  increased  accuracy 
(i.e.,  larger  array  size),  there  will  be  a  point  at  which  the  optimization 
technique  will  become  advantageous. 

At  present  the  best  way  to  establish  what  technique  Is  to  be  used  is  to  run 
a  particular  manipulator  with  both  techniques  using  a  small  array  such  as  10x10. 

Note  the  time  for  each  run  and  use  the  calculation  scaling  1aw3  mentioned  to 


determine  which  technique  will  take  the  least  time  for  a  des'red  accuracy.  By 
using  the  trail  run,  effects  of  the  number  of  active  variables  on  the  optimization 
process  are  also  incorporated. 

The  Optimum  Path  Search  Technique  does  have  advantages  that  the  Grid-Scanning 
process  does  not.  For  Instance,  a  most  significant  one  is  that  the  former 
technique  lends  itself  to  secondary  workspace  analysis.  If  the  workspace  with  a 
specific  end-effector  orientation  is  required  over  a  plane  or  surface,  simply 
introduce  a  constraint  pertaining  to  orientation.  This  property  cannot  be 
achieved  with  the  Grid-Scanning  process.- 

4.  THE  WORKSPACE  GENERATION  TECHNIQUE  BASED  ON  OPTIMUM  PATH  SEARCH.  The 
basic  approach,  as  it  was  outlined  In  the  previous  section,  involves  the 
solution  of  the  following  optimization  problems. 

The  Optimization  Problem 

Minimize:  Distance  of  the  hand  to  the  desired  location 

Subject  to:  All  physical  constraints,  such  as  joints  within 
limits  of  rotations  and  translations,  etc. 

If  the  distance  falls  within  the  array  entry,  the  point  is  inside  the 
workspace.  Otherwise,  it  is  outside  of  the  workspace. 

To  perform  the  minimization  of  the  distance  function  subject  to  joint 
constraints,  an  objective  function  was  developed  using  the  exterior  method  to  be 
applied  to  an  unconstrained  optimization  algorithm,  the  conjugate  direction 
minimization  technique  of  Fletcher  and  Powell  or  the  FMFP  code  [S].  This  method 
Introduces  penalty  functions  which  Increase  the  value  of  the  objective  function 
if  a  constraint  is  violated. 

Formulation  using  the  exterior  method  is  as  follows  [16], 

Minimize:  f(U)  *  (Tx-«x)2  +  <VH>)2  *  lYHz>2 

Subject  to:  (13) 

Limits  of  Joints:  $^(U)  S  O'  with  i  =  1  to  j 

where  f(U)  denotes  the  objective  function  for  the  constrained  optimization 
problem,  0  denotes  the  argument  vector  or  design,  vector,  T(Tx,fy,Tz)  and 
R(HX!>Hy*Hz)  denotes  the  target  vector  and  the  position  vector  ot  the  hand  or 
end-effector,  respectively. 

The  unconstrained  optimization  problem  Is  defined  as 


Minimize:  F(U)  =  f(U)  +  C1E(U)  (14) 


where  Cj  is  a  weighting  constant,  £  is  the  constraint  function 
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An  algorithm  written  In  FORTRAN  language  for  generating  the  workspace  of  a 
general  manipulator  having  prismatic,  cylindrical  or  screw  joints  «.s  well  as 
revolute  joints  has  been  developed.  The  principal  and  details  of  the  algorithm 
are  given  in  Ref.  (16]. 

5.  CASE  STUDIES.  Four  problems  which  are  believed  to  be  representative 
are  chosen  to  demonstrate  the  effectiveness  as  well  as  the  capability  of  the 
workspace  generation  algorithm.  The  first  two  prob’ *ns  provide  some  indications 
of  how  well  the  prediction  of  the  algorithm  compares  with  both  the  Grid-Scanning 
Technique  of  Lee  and  Vang  [1]  as  well  as  the  theoretical  result  on  a  special 
case,  the  3R  Manipulators  taken  from  Gupta  and  Roth  [10] .  The  other  two 
problems  deal  with  industrial  robots  having  the  prismatic  as  well  as  the 
revolute  joint.  They  are  used  to  demonstrate  the  capability  of  the  algorithm  to 
handle  manipulators  with  joints  other  than  the  revolutes.  The  quantitative 
workspace  information  of  these  two  Industrial  robots  presented  here  is  believed 
to  appear  first  time  on  public  record. 

£;  ~.p1e  1:  A  3R  Manipulator  £10] .  This  example  is  used  to  verify  the 
accuracy  of  the  algorithm.  The  kinematic  parameters  of  this  manipulator  are 
given  in  Table  2.  A  comparison  Is  given  in  the  following: 


Workspace 

Volume  fin. 3] 

Percent 

Error 

This  investigation 

1255. S 

4.4% 

Analytical  result  [10] 

1202.7 

— 

The  Scanning  Technique 

[1] 

1215.7 

1.1% 

The  bounds  on  array*: 

Y  . 
mm 

^max 

*  2.0,  YmA.  =  18.0,  Z.  = 
ma«  min 

=  2.0 

-2.0  and 

*The  bounu*  on  array  denote  the  physical  scaling  of  the  array  which 
gives  the  mathematical  representation  of  the  plane  in  which  the 
workspace  projection  is  made. 


Figure  3  gives  the  computer  graphics  output  of  the  circular  projection 
this  manipulator. 

Example  2:  A  3R  Manipulator.  This  example  <s  used  to  verify  that  the  void 
handling  features  of  the  algorithm  worked  correctly. 


The  kinematic  parameters  table  is  similar  to  that  of  Example  1,  except 
is  changed  to  aj  =  20°  as  shown  In  Table  2.  Consequently,  a  change  is  resulted 
and  a  void  appears  in  the  workspace,  as  shown  in  Fig;  4. 

The  Bounds  on  array:  Ym1n  =  -2.0,  Y  -•  18.0,  »  -2.0  and 
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Workspace  volume:  2400.96  in  . 


Example  3:  The  Bendlx  AA  160-CNC  Manipulator.  Table  3  gives  the  kinematic 
parameters  of  this  manipulator  which  involves  one  prismatic  joint. 

Bounds  on  array:  Ymi„  =  -70,  Y w  =  70,  =  -70,  and  =  70. 

Workspace  volume:  350036.0  In'*. 

Figure  5  gives  the  graph  of  workspace  cross-section. 

Example  4:  A  GCA  Gantry  Manipulator.  This  manipulator,  given  kinematically 
In  Table  4,  contains  three  prismatic  joints.  There  are  four  variables  which  are 
active  in  defining  the  workspace:  02,  03,  04  and  05.  Because  of  the  large  number 
of  active  variables  Involved,  the  problem  lends  itself  well  to  the  new  technique. 
Since  the  workspace  is  intuitively  easy  to  see,  it  provides  as  another  verification 
of  the  technique. 

Bounds  on  array:  Ymin  =  -200.0,  Y^  =  200.0,  =  -200.0  and 

’W  •  200-° 

Workspace  volume:  1038600.0  in3. 

The  workspace  cross-section  is  shown  in  Figure  6. 

6.  CONCLUSIONS.  An  algorithm  for  the  generation  and  evaluation  of  a 
manipulator  workspace  based  on  optimum  path  search  has  been  presented.  Numerical 
examples  were  given  and  results  displayed  on  a  number  of  industrial  robots 
having  prismatic  as  well  as  revolute  joints.  Perhaps  this  is  the  first  workspace 
algorithm  based  on  the  [3x3]  dual-number  matrix  formulation,  rather  than  the 
conventional  [4x4]  matrix  method  and  applicable  to  manipulators  with  joints 
other  than  the  revolute  type.  Besides  the  advantage  of  computational  efficiency 
dealing  with  especially  complicated  manipulator  geometries,  the  technique  also 
lends  itself  more  effectively  over  the  previous  full -scanning  technique  [1]  for 
some  potential  applications,  such  as  the  determination  of  secondary  workspace 
and  the  control  over  work  area  of  interest  during  specific  applications— for 
instance,  cutting  workspace  in  a  specific  plane  such  as  the  plane  that  a 
conveyor  passes  through  [8];  the  reachable  area  over  a  surface  not  a  plane  such 
as  during  assembly  of  an  object;  and  accessable  areas  in  the  last  two  kinds  of 
problems  with  specific  end-effector  orientation,  for  example,  when  manipulating 


objects  on  a  conveyor  or  installing  parts.  The  Investigation  of  these  subjects 
which  is  of  interest  and  of  practical  concern  represents  a  continuing  work  of 
the  authors. 

7.  ACKNOWLEDGEMENTS.  The  second  author  is  grateful  to  the  U.S.  Army 
Research  Office  for  the  support  of  this  research  through  Contract  DAAG29-81-K-006 
to  Rutgers  University. 

8.  REFERENCES. 

[1]  Lee,  T.W.  and  Yang,  D.C.H.,  "On  the  Evaluation  of  Manipulator  Workspace," 
Journal  of  Mechanisms,  Transmissions  and  Automation  in  Design,  Trans. 

ASME.  Vol.  105,  No.  1,  March  1983,  pp.  70-77. 

[21  Hansen,  J.A.,  Gupta,  K.C.  and  Kazerounian,  S.M.K. ,  "Generation  and 

Evaluation  of  the  Workspace  of  a  Manipulator,"  Int'l  Journal  of  Robotics 
Research,  Vol.  2,  No.  3,  Fall  1983,  pp.  22-31. 

[3j  Kumar,  A.  and  Waldron,  K.J.,  "The  Workspace  o*  a  Mechanical  Manipulator," 
Journal  of  Mechanical  Design,  Trans.  ASME,  Vol.  103,  No.  3,  July  1981. 

[4]  Suglmoto,  K.  and  Duffy,  J.,  "Determination  of  Extreme  Distances  of  a  Robot 
Hand  -  Part  I,"  Journal  of  Mechanical  Design,  Trans.  ASME,  Vol.  103,  No. 

3,  July  1981,  pp.  631-636. 

[5]  Tsai,  Y.C.  and  Soni,  A.H. ,  "Accessible  Region  and  Synthesis  of  Robot 
Arms,"  Journal  of  Mechanical  Oesigri,  Trans.  ASME,  Vol.  193,  No.  4, 

Oct.  1983  ,  pp.  803-811. 

[6]  Pennock,  G.R.  and  Yang,  A.T.,  "Dynamic  Analysis  of  a  Multi -Rigid-Body 
Open-Chain  System,"  Journal  of  Mechanisms,  Transmissions  and  Automation 
in  Design,  Trans.  ASME,  Vol.  105,  No.  1,  March  1983,  pp.  28-34. 

[7]  Featherstone,  R.,  "The  Calculation  of  Robot  Dynamics  Using  Articulated- 
Body  Inertias,"  Int'l.  J.  of  Robotics  Research,  Vol.  2,  No.  1,  1983, 
pp.  13-30. 

[8]  Jou,  T.M.  and  Waldron,  K.J.,  "Geometric  Design  of  Manipulators  Using 
Interactive  Computer  Graphics,"  Proceedings  of  6th  World  Congress  of  the 
International  Federation  of  Theory  of  Machines  and  Mechanisms.  New  Delhi, 
India,  Dec.  15-20,  1983. 

[9]  Fletcher,  R.  and  Powell,  M.J.O.,  "A  Rapidly  Convergent  Descent  Method  for 
Minimization,"  British  Computer  Journal,  Vol.  6,  1963,  pp.  163-168. 

[10]  Gupta,  K.C.  and  Roth,  B.,  "Design  Considerations  for  Manipulator 
Workspace,"  ASME  Journal  of  Mechanical  Design,  Vol.  104,  No.  4,  Oct. 

1982,  pp.  704-711. 

[Ill  Paul,  B.,  Robot  Manipulators;  Mathematics,  Programming  and  Control, 

MIT  Press,  Cambridge,  MA,  1981,  p.  279. 


[12]  Denavit,  J.  and  Hartenberg,  R.S.,  "A  Kinematic  Notion  for  Lower-Pair 
Mechanisms  Based  on  Matrices,"  ASME  Journal  of  Applied  Mechanisms, 

June  1955,  pp.  215-221. 

[13]  Oimentberg,  F.M.,  The  Screw  Calculus  and  Its  Applications  in  Mechanics, 
(Izdat.  "Nauka."  Moscow,  USSR,  1965)  English  translation:  AS680993, 
Clearinghouse  for  Federal  and  Scientific  Technical  Information,  April 
1968. 

[14]  Vang,  A.T.,  “Calculus  of  Screws,"  Basic  Question  of  Qesign  Theory,  North 
Holland  Publishing,  Amsterdam,  1974,  pp.  266-281. 

[15]  Hsia,  l.K.  and  Yang,  A.T.,  "On  the  Principle  of  Transference  In  Three- 
Dimensional  Kinematics,"  ASME  Journal  of  Mechanical  Design,  Yol  103, 

No.  3,  July  1981,  pp.  652-656. 

[16]  Cwiakala,  M. ,  On  the  Kinematics  and  Dynamics  and  Computer-Graphics 
Modeling  of  Mechanisms,  Master  degree  thesis,  College  of  Engineering, 
Rutgers,  The  State  University  of  New  Jersey,  New  Brunswick,  NO,  Oct.  1984. 


LIST  OF  FIGURES 


Figure  1 
Figure  2 

Figure  3 
Figure  4 
Figure  5 

Figure  6 


Table  1 

Table  2 
Table  3 
lable  4 


Geometric  description  of  a  manlpulrtor 

Geometrical  relationships  between  joints  n  and  n-1 

(an_i  =  common  normal,  bn_j  =  axial  distance  and  on-l  =  twist  angle) 

Circular  projection  of  the  workspace  for  Example  1 

Circular  projection  of  the  workspace  for  Example  2 

Circular  projection  of  the  workspace  for  the  Bendix  AA-160  CNC 
Manipulator 

Projection  of  the  workspace  on  the  X-Y  plane  for  the  6CA  Gantry 
Manipulator 


LIST  OF  TABLES 

A  comparison  of  the  Optimum  Path  Search  Technique  of  this 
investigation  with  the  Scanning  Technique  of  Lee  and  Yang  [1]. 

Kinematic  parameters  for  the  3R  Manipulator 

Kinematic  parameters  for  the  Bendix  AA  160-CNC  Manipulator 

Kinematic  parameters  for  the  GCA  Gantry  XR  Manipulator 


vvv' 


>  ■  .  r  j  n,.  n  »  -  -<•  *  i-  _-_  v»jv -«i.'v#  vh.‘*t*-*ct  _»vv»cn  ^rf J5.  i.-3T.-Tr,rrr *-* 


Figure  2 


some  trice!  relationships  between  Joints  n  and  n-Uju-l  *  camon 
>rmal,  bn-i  *  axial  distance  and  on-i  *  t^Ut  angle). 


778 


Circular  projection  of  the  workspace  for  Example 


WORKSPACE 


■ubwa \ w ’■■.->■ *-i<~ c-*  ^ J-  *-'  •- 


Table  1  A  comparison  of  the  Optimum  Path  Search  Technique  of  this 

investigation  with  the  Scanning  Technique  of  Lee  and  Yang  [13. 


Property 

The  Scanning  Technique 
--  Lee  and  Yang  [13 

The  Optimum  Path  Search 
—  This  investigation 

Simplicity  in 

Programmi ng 

Yes 

No 

Computational  time 
if  grid  is  doubled* 

2n 

2 

Information  about 
reconfiguration  of 
manipulator** 

No 

Yes 

Application  of 
analysis  over  a 
non-planar  surface 

Not  easy 

Easier 

Application  to 
secondary  workspace 

_ 

No 

Yes 

*  n  *  number  of  active  joints  to  describe  workspace 

**  It  is  possible  to  specify  a  path  through  the  workspace,  determining  if  the 
path  can  be  achieved,  and  if  at  any  point  reconfiguration  was  necessary  to 
continue. 


Table  2  Kinematic  parameters  for  the  3R  Manipulator 


RECURSIVE  GRADIENT  ESTIMATION  USING  SPLINES 
FOR  NAVIGATION  OF  AUTONOMOUS  VEHICLES 


C.  N.  Shea 
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Armament  Research  and  Development  Command 
Large  Caliber  Weapon  Systems  Laboratory 
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Watervlie: ,  NY  12189 


ABSTRACT .  Terrain  gradient  estimation  is  needed  for  navigation  of  an 
autonomous  vehicle  in  climbing  the  hills.  The  in-yath  and  cross-path  terrain 
slopes  are  estimated  from  the  set  of  corresponding  range  slopes.  A  two- 
dimensional  i  cursive  smoothing  algorithm  using  polynomial  splines  in  the 
third  dimension  is  developed  for  this  purpose.  Approximations  are  introduced 
in  the  sub-optimal  system  so  that  the  computation  time  increases  only  linearly 
with  the  size  of  the  two-dimensional  data. 


I.__  INTRODUCTION.  The  successful  development  of  an  autonomous  vision 
system  for  raooile  vehicles  would  be  of  considerable  value  and  importance  to 
defense  and  related  fields.  Numerous  reports  and  studies  currently  recommend 
artificial  intelligence/robotics  applications  which  require  autonomous 
vehicles.  Essential  to  these  robotic  vehicles  is  an  adequate  and  efficient 
computer  vision  system.  A  potentially  more  successful  approach,  other  than  TV 
pictures  and  photographs,  would  be  to  develop  a  three-dimensional  system 
employing  a  laser  rangefinder. 


A  range  matrix  describing  a  certain  scanned  area  of  the  terrain  in  front 
of  the  mobile  robot  can  be  used  [:o  estimate  the  slopes  of  the  terrain.  The 
ln-path  and  cros3-path  slopes  of  the  terrain  are  evaluated  by  a  slope 
estimation  scheme.  These  3lope  informations  along  the  passible  corridors  are 
utilized  to  determine  a  safer  and  more  accurate  path  for  the  mobile  robot 
vehicle  to  travel. 


The  mobile  robot  vehicle  is  equipped  with  data  acquisition  and  decision 
making  devices  for  its  autonomous  navigation  over  rough  terrain.  A  laser 
rangefinder  can  be  operated  by  emitting  laser  pulses  and  measuring  the  time  of 
flight  of  a  pulse  between  the  instant  it  was  transmitted  and  the  instant  the 
reflected  pulse  Is  received..  This  time  of  flight  is  related  to  the  distance 
between  the  transmitter  and  the  point  on  the  terrain  from  which  the  pulse  is 
reflected.  The  terrain  is  scanned  by  changing  the  azimuth  and  elevation 
angles  of  the  laser  bean  In  a  discrete  fashion.  The  measurements  are  then 
available  in  the  form  of  a  NXM  ’range-matrix'. 


*The  author  is  also  employed  by  Rensselaer  Polytechnic  Institute,  where 
he  holds  the  title  Professor  in  the  Electrical  Computer  anu  Systems  Engerineering 
Department . 
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The  slope  estimation  problem  dealt  with  in  thi9  section  is  that  of 
obtaining  smoothed  estimates  of  function  values  and  particularly  their 
derivatives  from  a  finite  set  of  inaccurate  measurements  in  two-dimensions. 

In  one  approach  we  can  identify  the  dynamic  equations  of  the  underlying 
system,  or  estimate  the  distributions  for  the  quantities  of  interest  and  then 
apply  optimal  estimation  algorithms.  In  some  engineering  problems  the 
stochastic  system  may  not  be  identified  easily  and  in  these  situations,  spline 
smoothing  has  proved  to  be  a  useful  alternative. 

In  this  paper,  we  obtain  the  smoothed  estimates  of  the  slopes  by 
utilizing  a  two-dimensional  smoothing  algorithm.  For  the  problem  of  smoothing 
a  finite  set  of  noise  corrupted  data  of  an  unknown  function,  it  is  proposed  to 
obtain  the  smoothed  estimate  by  fitting  a  two-dimensional  approximating 
function  to  the  data  set,  for  a  set  of  measurements  corrupted  by  a  white  noise 
process. 


II.  HISTORICAL  REVIEW.  By  noting  the  fact  that  original  signals  such  a3 
visual  scenes  are  in  analog  form,  techniques  were  developed  which  reconstruct 
analog  signals  from  discrete  data  by  utilizing  interpolation  or  approximating 
functions.  Frequency  domain  interpretation  of  the  interpolation  process  war 
reported  in  Reference  {1].  Also,  B-spiine  interpolates  [2-4]  were  used  [5]  in 
restoring  a  continuous  signal  from  a  set  of  digitized  data.  For  one¬ 
dimensional  noise  corrupted  data  generated  by  unknown  systems,  Reinsch  [6] 
utilized  natural  cubic  splines  [2-4]  along  with  least  squares  constraints  to 
solve  the  problem  of  curve  plotting.  Hou  and  Andrews  [7]  constructed 
continuous-discrete  image  and  utilized  spline  basis  functions  along  with  the 
least  squares  constraints  for  image  restoration.  Because  of  their  non¬ 
recursiveness,  the  algorithms  in  References  [6,7]  are  involved  with  complex 
computations  and  cannot  be  implemented  on  line.  Recently,  by  using  a 
reproducing  kernel  Hilbert  space  approach,  Weinert  [8,9]  et  al,  developed  a 
structural  correspondence  between  spline  interpolation  and  linear  least 
squares  smoothing  of  a  particular  random  process. 

In  recent  years,  two-dimensional  recursive  filters  have  drawn  much 
actention  because  of  the  need  for  processing  images  or  other  two-dimensional 
information.  Previous  efforts  [10-12]  to  achieve  a  truly  recursive  two- 
dimensional  filter  were  of  only  limited  success  because  of  the  difficulty  in 
establishing  a  suitable  two-dimensional  recursive  model  as  well  as  the  high 
dimension  of  the  resulting  matrix  and  state  vector.  Recently,  by  using  a  two- 
dimensional  recursive  model  obtained  from  a  two-dimensional  spectral 
factorization  technique  [13],  Woods  and  Radewan  [i4]  developed  a  two- 
dimensional  Kalman  vector  processor  and  a  two-dimensional  Kalman  scalar 
processor.  The  above  mentioned  time-domain  design  techniques  assumed  or 
identified  a  two-dimensional  stationary  discrete  system  model  at  the  beginning 
of  their  problem  formulation.  On  the  other  hand,  Reinsch  [6]  interpreted  a 
one-dimensional  data  smoothing  problem  as  an  optimal  curve-fitting  problem 
arising  in  approximation  theory  and  proposed  a  nonrecursive  smoothing 
algorithm  using  smoothing  splines.  For  a  two-dimensional  image  restoration 
problem,  Hou  and  Andrews  [7]  followed  the  approach  taken  by  Reinsch  [6],  and 
extended  it  to  a  two-dimensional  problem,  in  a  nonrecursive  manner.  On  this 
paper,  we  develop  a  two-dimensional  recursive  smoothing  algorithm.  Compared 
to  its  nonrecursive  counterpart,  this  recursive  algorithm  will  require  less 
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computational  complexity  and  memory  space.  Especially,  the  amount  of 
computation  needed  at  each  iteration  is  independent  of  the  size  of  the  two- 
dimensional  data. 

III.  PROBLEM  FORMULATION  FOR.  ONE-DIMENSIONAL  APPROXIMATION.  From  the 
viewpoint  of  approximation  theory,  when  a  set  of  discrete  observation  data  is 
noise  free,  spline  interpolation  provides  a  means  of  optimally  reconstructing 
an  unknown  original  signal.  When  the  observation  data  are  corrupted  by 
noises,  and  if  the  form  of  the  original  continuous  signal  is  known,  then  we 
can  use  least  squares  estimation  techniques  to  approximate  the  original 
signal.  In  this  paper,  we  are  dealing  with  a  problem  in  which  an  unknown 
signal  is  approximated  by  smoothing  splines  from  a  set  of  noise  corrupted 
observation  data.  Specifically,  an  unknown  signal  f(£)  is  approximated  by  a 
polynomial  spline  s(£)  which  minimizes  the  objective  function: 

N  N  Cn 

J*  -  l  [s(5n)  "  “nl^^sC^n)  -%]  +  {£  PnL  [sk(<i)]2dO  (D 
n=l  n=2  5n_1 

where 

mn  is  an  observation  data; 

®n  ■  f(5n)  +  vn»  for  n  -  1,  2,...,N; 

Vn  is  a  white  observation  noise  process  with  error  covariance  Rq} 

Rn  "  E{vn*vnT> 

Pn  >  0  is  a  smoothing  parameter;  and 

s*  is  the  ktk  derivative  of  s(5). 

At  this  point,  it  is  worthwhile  to  note  the  physical  role  of  the  smoothing 
parameter  pn  as  follows:  (a)  when  pn  becomes  very  small  pn  +  0*",  the 
resultant  approximating  function  will  pass  through  each  data  point  and  become 
an  interpolation  function;  (b)  when  pn  assumes  a  very  large  value,  Pn  °», 
minimization  of  the  objective  function  in  Eq.  (1)  corresponds  to  fitting  a 
straight  line  to  a  data  set  using  least  squares  criterion.  Thus,  it  can  be 
said  that  the  smoothing  parameter  controls  resolution  in  a  tradeoff  of  the 
smoothness  of  the  restored  function. 

A.  Choice  of  Approximating  Function.  As  has  been  noted,  it  is  desired 
to  develop  a  recursive  algorithm  whose  results  are  sufficiently  close  to  those 
obtained  by  directly  minimizing  the  global  problem  as  given  by  the  criterion 
in  Eq.  (1),  Fundamental  problems  encountered  in' developing  a  recursive 
algorithm  which  generates  approximating  functions  are: 

(1)  feasibility  of  recursive  structures, 

(2)  I'easibility  of  numerical  calculations. 

Regarding  the  first  problem,  it  has  been  noted  from  References  [1-5]  that  some 
of  the  approximating  functions  such  as  polynomial  splines  and  piecewise 
Herraite  polynomials  have  finite  support.  That  is,  a  resultant  approximating 
function  for  one  section  is  mostly  affected  by  its  neighboring  data  points. 
Thus,  a  recursive  structure  with  one  or  more  sample  delays  would  result  in 


sufficiently  close  results  to  nonrecursive  ones. 


For  the  second  point,  out  of  a  certain  set  of  functionals,  an  optimal 
solution  to  Eq.  (1)  is  an  L-spline  12-4].  L-spline  is  a  piecewise  polynomial 
of  degree  2k-l,  and  has  2k-2  continuous  derivatives  in  the  region  {$1,  %] . 
Here,  we  propose  to  restrict  our  approximating  functions  to  piecewise  Hermite 
polynomials  [3]  of  degree  2k-l,  which  have  k-1  continuous  derivatives  in  the 
region  [Ci,Cn1*  Advantages  in  using  piecewise  Hermite  polynomials  are  as 
follows.  Define 


xi  -  isa),  s’(o,...,sk-1u)i 


C-Ci 


i  ■  1 , . . .  ,N 


then  a  piecewise  Hermite  polynomial  s(£)  is  completely  determined  by  x^,  i  * 
1,2, .. . ,N.  For  the  purpose  of  clarity  in  discussion,  only  the  case  of  k-2  is 
treated  in  the  following.  A  piecewise  cubic  Hermite  polynomial  is  represented 
as: 

<  < 

81,2(0  for  Ci  -  C  -  C* 

•  • 

s(5)  *  .  .  (2) 

<  < 

»N-1,N(C)  for  £n-i  -  5  -  Cn 

where 

sk-l,k(5)  “  [*k,l<S>*k,l(C)4<k,0<C>  4k,0(C)]l*kT,  *k-lTJT  (2a> 

Xk  -  [s(Ck),s'Uk)]T,  k  -  1,...,N  (2b) 

<t>k,l(^  -  (5“Ck-i)2[(Ck-Ck-i)  +  2(5k-C)]/(Ck-Ck-1)3  (2c) 

’he,l(5)  m  < C-Ck-i) 2( C-Ck)/( Ck-Ck-l) 2  (2d) 

*k,0(5)  “  (Ck  ;)2[(Ck-Ck-i)  +  2(5- Ck-i ) ] / ( Ck“ Ck-i ) 3  <2e) 

and 

'he, 0(C)  -  (C-Ck-i)(Sk-C)2/(Ck-5k-l)2  (2f) 

B.  Smooth  Integral  As  Quadratics  at  Node  Points.  Thus,  it  becomes 
natural  that  the  smoothing  integral  in  Eq.  (1)  is  expressed  in  terms  of  Xj’s, 
i  *  1,2, .. .  ,N.  With  some  manipulations  in  algebra,  the  smoothing  integral  for 
k  *  2  in  Eq.  (1)  is  represented  in  a  quadratic  form  as  derived  in  Appendix  A. 


J*  |  |s"(C)|  |2  d5  -  (xn  -  A*xn-i)TB“1(xn-A*xn-i) 

Cn-1 


xn-1 

T 

Cll  C12 

xn-l 

xn 

C21  C22 

*n 

_ _  ___ 

(3) 
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where 


A* 


xn  =  [3(5),  s'( 5)]T 


1  A 
0  1 


A2 

•  T 

.  A 


(4) 


4  *  5n  -  5n-l,  for  n  *  2 . N 

A  .  4  A  A 

Cn  =  A*tB_1A*  ,  O12  “  02lT  -  -A*t  B"1,  C22  *  3-1 


(5a) 

(5b) 

(5c) 


A  iionrecurslve  solution  for  the  minimization  of  the  objective  function  in 
Eq.  (1)  can  be  obtained  by  taking  the  gradient  of  J*  with  respect  to  [xj_, 
X2,«**,Xfl]T  and  setting  it  to  zero.  However,  this  approach  will  require 
solutions  of  a  set  of  2N  simultaneous  equations.  To  avoid  this  computational 
problem,  we  developed  a  recursive  algorithm  which  requires  inversion  of  2  x  2 
matrices  only. 

A 

IV.  RECURSIVE  ALGORITHM.  Given  a  set  of  initial  values  for  the  mean  xi 
and  its  error  covariance  Pi,  where  Pi  »  E{(xi“Xi)(xi~xi)T}»  by  using  Eqs.  (3) 
and  (4)  the  objective  function  in  Eq.  (1)  becomes 

N  - 

%  *  I  [(Hxn-mn)TRn"1(Hxn-ma)]  +  (xi-xi)TPi"1(xi-xi) 
n=2 

N 

+  I  Pn(xn-A*xn-i)TB~ 1(xn-A*xn-i)  (&) 

n*2 

where  H  *  (1, 0) . 

A  A  /• 

Let  the  solutions  to  the  above  optimization  problem  be  [xi*,x2*, . . . ,xn*J • 

A 

If  Xp | q  is  defined  as  the  estimate  of  xp  obtained  by  minimizing  Eq.  (6)  with  N 

A  A 

*  q,  then  xj*  can  be  written  as  xjl  |  n*  Here,  it  is  proposed  to  approximate  a 

A  A 

nonrecursive  solution  X£*  *  x-^ | jq  by  xiji+ji,  where  A  ■  0  and  A  *  N.  As  has 
been  mentioned  before,  due  to  a  local  base  property  of  the  polynomial  splines, 

A 

the  smoothed  estimate  xi|i+£  would  be  sufficiently  close  to  the  nonrecursive 

A 

solution  xi*  for  A  =*  1,  2,  or  3. 
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A.  Filtering.  From  the  definition,  a  filtered  estimate  would  be 

obtained  by  minimizing  the  objective  function  in  Eq.  (6)  with  N  -  i.  By 
taking  the  gradient  of  Jt  with  respect  to  [xlt  x2,...,xi],  we  have  a  set  of 
simultaneous  equations  as  follows: 


PjC2l|+  PjM.Cn 


HTR-1 


Pk-2^22 

+  Pk-lCll 

Pk-lCl2 

+  hTr-1  H 

k-1 

1  ®k-l 
k“l 


HtR“ 1mv 

k 


It  should  be  noted  that  the  matrix  on  the  left  side  of  Eq.  (7)  is  diagonally 
dominant  and  positive  definite.  Here,  we  are  interested  in  solving  Eq.  (7) 
for  xi.  Thus,  by  eliminating  the  first  four  equations,  i.e.,  the  variable 

a 

Eq.  (/)  is  reduced  to  the  same  form  as  itself  with  j  “  2,  k  =■  i,  and  X2|2  and 
E2  are  calculated  by  Eqs.  (7c)  and  (7d).  Note  that  the  quantity  Ej  defined  by 

A  A 

Eq.  (7d)  is  called  pseudo  error  covariance,  because  P|c|k  «  Eixk“5nc.jk)(xk~xk|k)Ti 
cannot  be  computed  in  a  recursive  manner  directly.  ! 


By  applying  this  reduction  method  repeatedly,  the  original  equations  in 
Eq.  (7)  are  rsduced  to  the  same  form  as  itself  with  j  *  i-1,  k  **  1  and  every 


xi|j  and  Ej  are  computed  by  Eqs.  (7 
with  j  •-*  *“]  ,  1c  *  i  in  terms  of  x^, 


c)  and  (7d)  recursively.  Solving  Eq.  (7) 
we  obtain  a  recursive  estimate  algorithm 


as 


xi| i  13  ~  ^-lCziGi-r^i-i]  (8) 

Equation  (8)  is  rearranged  as 

A  A 

xi|i  “  +  FiXi-xJi-!  (9) 

where 


Fi  “  ~Pi-lEic2l(  Pi-lCn+E^i-  *)“  1  (9a) 

and  E^’s  are  computed  by  Eq.  (7d)  recursively.  Equation  (9)  above  is  the 

A 

desired  filtering  equation  which  computes  xjji  from  the  previous  estimate 

xi-l|i— 1  a°d  the  present  measurement  m^.  From  the  viewpoint  of  smoothing 
spline,  the  recursive  filtering  algorithm  can  be  interpreted  as  follows.  The 
estimate  of  obtained  by  fitting  cubic  splines  to  the  measurement  data  mn, 

A 

n  =  2,  3,...,  with  the  initial  values  xi  and  Ei  =  Pi  is  the  same  as  the 
estimate  of  x^  obtained  by  fitting  a  cubic  polynomial  to  the  measurement  m^ 

A 

with  the  initial  values  at  stage  i~ 1 ,  xj— l|i— 1,  and  Ej—i.  In  fact,  the  above 
interpretation  comes  from  the  mathematical  derivations  in  Eq.  (7)  through  Eq. 


With  reference  to  Eqs.  (7)  and  (8),  each  iteration  of  the  recursive 
filtering  algorithm  can  be  interpreted  as  fitting  a  cubic  polynomial  to  the 

A 

previous  estimate  xi-iji-i  and  the  present  measurement  ra£  in  the  region 

t^i-l.^i]. 


B.  Smoothing.  A  smoothed  estimate  xjl  i+jj,  is  defined  as  the  estimate  of 
obtained  by  solving  the  minimization  problem  in  Eq.  (6)  with  N  *  i+A.  In 
fact,  this  can  !e  interpreted  as  fitting  a  polynomial  spline  to  the  first  i+A 
data  and  obtaining  the  function  value  and  its  derivative  from  the 
approximating  function  at  the  node  i.  In  our  formulation,  this  corresponds  to 
solving  the  simultaneous  equations  of  the  same  form  as  Eq.  (7)  with  j  »  l,  k  • 
i+A  and  the  quantities  and  d^  are  defined  in  Eqs.  (7a)  and  (7b).  By  using 
the  same  reduction  method  as  before,  the  result  would  be  the  same  form  as 
itself  (Eq.  (7)  with  j  »  1  and  k  *  i+A). 

For  the  case  of  a  cne-sample  delay,  xxjx+x  is  obtained  by  solving  the 
simultaneous  equations  in  Eq.  (7)  with  j  *  i,  k  *  i+1,  and  which  yields 

A  A 

xi  j  i+1  “  ViE^x^  +  Ki«i+1 

where 

Vi  =■  [-PiCi2(PiC22  +  ttTRi+r^p^i  +  Gi]-1 

Ki  -  -Vip£Cl2(PiC22  +  HtRi+x"  %)**  “  X®i+1 

and  Ex  is  defined  as  before. 

Equation  (10)  is  the  desired  smoothing  algorithm f  in  which  the  smoothed 

*  A 

estimate  xx|x+l  is  obtained  by  updating  the  filtered  estimate  xxjx  with  the 
measurement  mi+i.  The  smoothing  procedure  described  above  implies  the 
following.  The  smoothed  estimate  of  x*  obtained  by  fitting  cubic  splines  to 

A 

n^,  n  *  2,  3S...,  i+A  with  the  initial  values  x^  and  E^  «  Pj  is  the  sama  as 
the  smoothed  estimate  of  x^  obtained  by  fitting  splines  to  n  -  i+1,..., i+A 

A 

with  the  filtered  estimate  Xxjx  and  Ex* 

A 

A  recursive  procedure  to  obtain  smoothed  estimate  Xx|x+i  is  summarized  as 
follows:  s 


(10) 

(10a) 

(1Gb) 


Part  1.  Obtain  the  filtered  estimates  xx|x,  i  »  2,...,N  by  using  Eqs. 
(7d),  (9),  and  (9a). 

Part  2.  Smoothed  estimates  Xxjx+x,  for  .1  »  1,...,N-1  are  obtained  by 
using  Eqs.  (10),  (10a),  and  (10b). 

A 

As  was  mentioned  earlier  in  this  section,  tfca  smoothed  estimate  xxjx+x  is 

an  approximation  to  the  nonrecursive  solution  xj*  -  xxjfl.  Thus,  as  the  number 
of  delays,  A,  increase,  we  will  get  a  better  approximation  to  xx*.  For  A  -  2, 
3,...,  only  the  smoothing  part  is  modified  by  solving  the  simultaneous 
equations  in  Eq.  (7)  with  k  «  i+A.  In  fact,  the  smoothing  algorithm  developed 
by  far  is  a  fixed-lag  smoothing  algorithm,  which  is  suitable  for  an  on-line 
implementation.  If  the  situation  does  not  require  an  on-line  implementation. 
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we  can  also  derive  a  fixed-interval  smoothing  algorithm  with  observation  set  M 
*  {mi, . . . .  In  this  case,  the  resultant  smoothed  estimates  become  exactly 
the  same  as  the  nonrecursive  estimates. 

V.  SIMULATION  RESULTS.  For  a  continuous  signal 

f(5)  -  sin (5)  (11) 

measurements  are  obtained  at  discrete  points: 

mi  *  f(Ci)  +  vi  ,  i  -  1,...,100  (12) 

where  v*  is  white  Gaussian  measurement  noise, 

Ri  =  Etviv/1/  -  0.000025,  and  -  5n  -  ^-l  *  2ir/100  -  0.062832  (13) 

Function  values  and  the  first  derivatives  at  discrete  nodes  are  estimated 
from  the  measurements  mj,  i  *  1,...,100  by  the  three  schemes  below: 

1.  Difference  quotients  method. 

2.  Recursive  smoothing  algorithm  with  A  *  1:  Eq.  (10). 

3.  Nonrecursive  smoothing  by  cubic  splinep  as  described  in  Reference 
16]. 

Table  1  show3  the  mean-square  errors  from  the  three  schemes  above. 


TABLE  1.  MEAN-SQUARE  ERRORS 


where 

1  -.00 

e0  *  -t;  I  U(5i)  -  Xiii+ld))2, 
100  1,1 

l  100 

n'wo  -ali+xW))2 


From  Table  1,  it  is  noted  that  both  smoothing  algorithms  are  successful 
in  reducing  the  error  in  the  estimated  states.  The  error  in  the  first 
derivative  is  decreased  by  more  than  10  db.  Moreover,  Table  1  shows  that  the 
performance  of  the  two  smoothing  schemes  are  comparable.  However,  it  should 
be  emphasized  that  the  recursive  algorithm  developed  in  this  paper  is  much 
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simpler  than  the  nonrecursive  spline  smoothing. 


VII.  PROBLEM  FORMULATION  FOR  TWO-DIMENSIONAL  APPROXIMATION.  When  the 
observation  data  are  noise  corrupted  and  the  underlying  system  is  unknown,  it 
is  proposed  to  approximate  the  original  signal  by  spline  functions  which 
minimize  a  certain  objective  function.  Thus,  from  a  set  of  discrete 
measurements  m^  j  corrupted  by  white  noise  process  V£  j 


rai,j  “  fUi.hj)  +  Vij 


1,  2, . . .  ,N,  j  a  1, 


the  original  two-dimensional  signal  f(5,n)  defined  in  the  region  of  (5,n)  is 
approximated  by  a  spline  function  s(5,n)  which  minimizes  the  following 
objective  function: 

N  N 

J  35  I  l  IsCCi.hj)  -  «i#j]TRiirl!*i(€i,nj)  -  mt>j] 

j=i  f.»l 


An 

+  Pi  f  J  z(5,n)d5dn] 
*ii  k 


where  p  >  0  is  the  smoothing  parameter;  R^j  is  the  observation  error 
covariance;  and  z(S,h)  is  a  certain  smoothness  measure  of  s(5,n)  at  (5,n). 


A.  Choice  of  an  Approximating  Function.  In  this  paper,  we  are 
interested  in  obtaining  smoothed  estimates  of  function  values  and  the  first 
derivatives  in  both  5  and  n  directions.  Here,  we  propose  to  restrict  our 
approximating  functions  to  piecewise  bicubic  Hermite  polynomials  which  have 
continuous  first  derivatives  in  both  5  and  h  directions. 


Define 


*i,j  m  ls(5,n)  ,  “  (5,n)  »  (5»h), 

35  3'1 


(5,n)jT  (C,n)  -  (5i,nj> 
353n  J 


for  i  3  1,  2,  ...,N  aud  j  ®  1,  Then,  a  piecewise  bicubic  Hermite 

polynomial  is  completely  defined  by  x^j,  i  »  1,  2,..*,N  and  j  **  1, 
as  follov/s:  <  < 

s(£,n)  *  si  j(5,n)  ,  for  5i  -  5  »  £4+1  (18) 

and 


r,j  *  n  a 


v  I  i  *?■  v  \  V  “  .  s 


TOTraprxrs. -,'OTrjrry  ■.v'-y'T*:v’j'r  v ».v  •* 'J  .«  ct  :>!' »v* i1 »  J*Hi.->  lv i.V~^-~?^-:l.'y\^-^.-<Tl^.~  ■•■'■ 


where 


1  1 

si, “II 

2« 0  a-0 


4>£(0*<fta(n) 
*i(  O  •  $qj(  n> 
h  ( 5)  *  'JfeC  n) 
*i(  5)-^(n) 


*  xi+£,  j+m 


(20) 


B.  Choice  of  the  Smoothness  Measure  z(g,n).  Here,  we  present  three 
examples  of  the  smoothness  measures  and  compare  their  physical  implications. 

1,  Gaussian  cucvature:  In  Reference  [15],  the  mean  curvature  of  a 
surface  at  (5,n)  is  defined  as: 


(0.5)V2s(5,n)  (21) 

Noting  the  Euler’s  theorem  [16]  that  the  sum  of  two  curvatures  in 
perpendicular  directions  at  a  point  is  constant,  the  square  of  V2s(£,n)  in  Eq. 
(21)  would  be  a  reasonable  measure  for  the  smoothness  of  s  surface: 

a2  a2 

*(M>  "  1  Ji 3  9a»f,)  +  ^2  8<^*n>l  {22> 

2.  A  variation  from  the  Gaussian  curvature:  Witn  reference  to  Eq.  (22), 
an  interesting  case  occurs  when  the  two  principal  curvatures  are  equal  and  of 
opposite  sign.  The  mean  curvature  in  this  case  is  zero.  This  is  the 
so-called  "saddle  point"  and  every  surface  element  of  such  a  membrane  is  "pure 
twist."  An  appropriate  smoothness  measure  would  be  changed  to: 

32  32 

2<5,n)  -  t“^2  eU,n)]2  +  [-*2  s($,n)]2  (23) 

3.  In  Reference  [7],  llou  and  Andrews  suggested  to  use  j  j  V**s(  5,  n)  1 1 2  as  a 
smoothness  measure  for  a  surface.  The  physios*,  interpretation  of  the  quantity 
V4  s(5,n)  is  found  in  a  plate  bending  theory  [ioj;  an  unloaded  plate  can  bend 
only  in  a  biharmonic  function  w  where 

74u  -  0  (24) 

The  one  which  minimizes  the  objective  function  for  bicubic  Herraite  polynomials 
is  given  in  Appendix  B. 


c.  Smoothing  Integral.  Now,  it  ia  needed  to  determine  the  functioi 
which” minimises  the  objective  function  J  in  Eq.  (16).  It  is  note* 
the  smoothing  integral  in  its  present  form  gives  difficulties  in  finding 
explicit  solution.  By  evaluating  the  integrals  of  the  derivatives  of  ba 
functions  and  applying  some  algebraic  manipulations,  thene  smoothing  int 
are  converted  to  quadratic  forms  as  follows: 


nM  eN  M-l  N-l  Hj+i  $1+1 

p  !  /  [z(5,n)]dWn  «  ).  If  I  [z(£,n)]d<;dn 
ni  ei  j-l  hj 


k  *»  2,...,N,  and  so  on.  First,  we  will  discuss  a  filtering  procedure  for 

A 

,:k.2  |k,2>  k  =  2,  Then,  this  procedure  is  extended  for  the  filtered 

A 

estimates  for  %  =  2,  3,..-,M. 

A 

By  definition,  the  filtered  estimate  <k,2jk,2  *s  the  estiraa?  ■;  .  i 
obtained  from  an  approximating  function  wh^ch  minimizes  the  obje-  .,e  function 
in  Eq.  (27)  for  (p,q)  =  (k,2).  For  minimization,  we  take  the  gradient  of 
J(k,2)  with  respect  to  and  X2,  and  sat  it  to  zero 

VUl)  J(k,2)  "  0 

252 

where 

2Lj  =  x2,jT»*“.Xk,jT]T  (28) 

We  can  obtain  a  final  recursive  estimation  equation  as: 

A  A  A 

/*k,l|k,2\  /Px,l_1  *k,l  \  /*k-l,l|k-l,2\ 

r  Ek’2  U  -l  )  +  Fk*2  (  “  1  (29) 

'  xk,2|k,2  '  '«TRk,2  *k,2'  '  *k-l,l|k-l,2  / 

For  notation  used  in  the  above  equation,  see  reference  (17] .  With  reference 
to  the  final  estimation  equation  above,  it  is  noted  that  for  the  filtered 

»  •  A 

estimate  xj^lk^  the  scheme  uses  the  previous  estimates  x^-i ,1  |k-l ,2» 

and  ,2|k-l,2>  and  the  measurement  Here,  it  should  be  emphasized 

A  A 

that  the  scheme  uses  the  smoothed  estimate  xjj-l ,1 |k-l ,2  instead  of  Xk-1,1. 

Thus,  when  the  filtered  estimate  xj^lk  2  i®  computed,  it  is  needed  to  update 

the  estimate  xj^i  to  iq^ilk^  for  Jee  the  next  iteration.  Now,  by  using 

the  recursive  estimation  equation  in  Eq.  (28)  and  the  pseudo  error  covariance 

equation  in  Reference  [17],  we  can  compute  xj^jk^*  k  “  2....,N  recursively. 

A 

The  resultant  recursive  filtering  equation  for  becomes  similar 

A 

to  the  one  in  Eq.  (28).  Also,  the  smoothing  equation  for  becomes 

similar  to  the  one  for  After  all  the  estimates  in  the  third 

A  A 

column,  x*i>3j^t>3  and  x*;i> 3|k+l,3  k  »  1,  2,...,N  are  obtained,  the 

A 

smoothed  estimates  k  ■  1 , . . . ,N-1 ,  will  be  used,  for  the  estimates 

A  A 

in  the  fourth  column,  3C*it,A j k.,4  and  x*k,4|kfl,4>  and  ®°  on. 
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The  approximation  method  described  above  is  one  of  the  simplest  ones.  We 
can  employ  more  elaborate  approximation  methods  at  the  cost  of  more 
complicated  computations.  By  now,  we  have  introduced  a  recursive  quarter- 

a 

plane  processor  which  computes  the  filtered  estimate  x*^ as  an 

A  A 

approximation  to  |k,£s  the  definition  of  the  estimate  *k,Ajp,q»  t^ie 

A  A 

value  of  which  minimises  J  is  Here,  we  note  that  x*]^ i  has 

A 

its  support  in  the  region  R(h,£)»  while  the  nonrecursive  solution 

has  its  support  in  the  region  R^ni*  Thus  if  we  desire  to  have  a  better 


approximation  to  It  is  needed  to  develop  a  smoothing  algorithm  which 

>  <  < 

computes  %  >  I  ^+4  ,£+d„  where  dp  ,dv  =  i,  k+di  »  N  and  l+d2  =  M. 

'  12* 

The  smoothed  estimate  f £+<j  is  defined  as  the  estimate  of 

*  1*2  * 


obtained  by  fitting  an  approximating  function  in  the  region  R(k-fd  &+d  ) *  It 
has  been  derived  that  can  be  reasonably  approximated  by  xfc+d  jA+dj 

where  xjc+a^jt+d  I®  obtained  by  fitting  an  approximating  function  to  a  smaller 


region.  The  derivation  procedure  for  the  above  approximation  is  similar  to 
that  of  filtering  discussed  previously,  and  is  omitted  for  conciseness. 


IX.  FURTHER  NAVIGATION  PROBLEMS. 

A.  Terrain  Slones  and  Range  Slopes.  With  reference  to  Figure  1,  terrain 
in-path  and  cross-path  slopes  are  defined  as  the  two  orthogonal  slopes  dz/dp 
and  dz/pd3  in  a  cylindrical  coordinate  system.  During  the  past 
investigations,  the  terrain  slopes  were  found  to  be  appropriate  measures  for 
evaluating  a  terrain-  A  direcc  approach  for  estimating  the  terrain  slopes 
would  be  to  fit  a  smoothing  spline  to  the  measurement  data  in  cylindrical 
coordinates.  However,  there  is  a  major  difficulty  in  this  approach.  Even 
though  the  two  independent  variables  and  for  the  rangefinder  are 
changing  with  cons tan.  increments  Afi  and  A0,  respectively,  the  independent 
variable  p;  in  a  cylindrical  coordinate  changes  irregularly.  The  recursive 
smoothing  algcritnm  in  the.  previous  subsection  requires  that  the  data  points 
be  located  at  the  corners  of  rectangular  grids  of  the  two  independent 
variables.  Since  the  two  independent  variable*;  p$  and  Sj  in  a  cylindrical 
coordinate  system  do  not  form  rectangular  grido,  the  smoothing  algorithm* 
cannot  be  applied  directly.  By  noting  that  the  positioning  angles  and  9j 
ere  changing  in  regular  fashion,  it  is  proposed  to  obtain  the  mionthed 
estimates  of  the  range  slopes  dr/dd  and  dr/dO  defined  in  spherical 
coordinates.  Then,  these  estimates  are  transformed  to  the  terrain  slopes.  In 
appljin.g  the  smoothing  algorithm  to  terrain  slope  estimation,  one  point  to  ,be 
mentioned  is  that  the  basic  philosophy  of  the  smoothing  spline  approach  is  to 
suppress  the  noise  elements  by  fitting  a  smooth  approximating  function  to  a 
noise  corrupted  data  set.  Thus,  when  the  function  to  be  approximated  has 
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sharp  changes  in  its  values  or  derivatives,  the  smoothing  algorithm  will 
produce  errors  in  the  results  by  smoothing  out  these  actual  sharp  changes. 

From  the  viewpoint  of  terrain  slope  estimation,  such  changes  occur  at  the 
edges  of  a  boulder,  a  crater,  or  a  ridge  on  the  terrain.  Thus,  it  is  proposed 
to  detect  these  edges  by  using  the  rapid  estimation  scheme.  Then,  for  the 
area  which  is  free  of  discrete  edges,  the  two-dimensional  smoothing  algorithm 
is  utilized  to  estimate  the  slopes.  The  terrain  slopes  are  estimated  in  the 
order:  (1)  discrete  edges  are  detected  by  using  the  rapid  estimation  scheme; 
(2)  for  the  area  which  is  free  of  discrete  edges,  a  two-dimensional  smoothing 
algorithm  is  utilised  to  estimate  the  range  slopes;  (3)  estimated  range  slopes 
are  transformed  into  terrain  slopes. 

B.  Estimated  Terrain  In-Path  Slope.  The  simulation  of  terrain  with 
hills  and  valleys  is  given  in  Figure  2.  The  estimated  terrain  in-path  slopes 
[18]  are  displayed  in  terms  of  a  elope  map,  Figure  2.  Characters  A, ...,G 
represent  a  particular  range  of  the  terrain  in-path  slopes  increasing  from  A 
to  G,  at  the  corresponding  location.  U  represents  undefined  slopes.  In 
Figure  3,  we  note  circular  slope  regions  on  the  faces  of  sinusoidal  hills  and 
valleys.  Also,  along  a  radial  direction,  the  estimated  slopes  are  changing 
slowly  from  one  region  of  slopes  to  another.  The  large  empty  spaces  are  due 
to  the  hidden  regions  at  the  back  of  boulders  or  hills  where  laser  rays  could 
not  reach.  The  undefined  gradient  represented  by  'U'  occurs  when  the 
recursive  algorithm  cannot  be  applied  due  to  sharp  changes  in  ranges  between 
adjacent  measurement  data.  The  estimated  in-path  terrain  slope  maps  are  used 
for  the  evaluation  of  the  terrain  in  front  of  the  mobile  robot  vehicle. 

C.  Terrain  Cross-Path  Slopes.  In  discussing  the  terrain  cross-path 
slopes  [19],  the  data  can  be  conveniently  processed  to  generate  smoothed 
in-path  and  cross-path  range  slopes  recursively  in  spherical  coordinate  system 
due  to  the  regularity  of  the  elevation  and  azimuth  angles.  When  we  proceed  to 
calculate  the  true  terrain  slopes  on  the  base  plane,  the  regularity  of  the 
data  points  are  completely  destroyed.  For  a  fixed  elevation  angle  8,  the 
horizontal  projection  of  the  range  data  are  not  located  at  a  fixed  distance 
from  the  rover.  It  is  desired  to  calculate  the  cross-path  slope  at  point 
($i,0j).  However,  in  general,  Pi  j  ^  Pi, jH*  Thus,  the  true  cross-path  slope 
is  not  along  an  arc  connecting  points  Pj^j  and  Pi>j+i» 

Our  algorithm  to  calculate  the  terrain  cross-path  slope  can  then  be. 
summarized  as  follows: 

1.  Obtain  the  range  measurements. 

2.  Use  the  smoothing  algorithm  to  calculate  the  range  cross-path  slope. 

3.  Obtain  the  terrain  cross-path  slope  and  its  variance. 

D.  Evaluation  of  Terrain  Variables.  As  mentioned  in  the  introduction, 
in  the  evaluation  of  terrain  variables,  we  only  use  the  slopes  at  the  spine 
and  track  points.  The  reason  is  two-fold.  First,  only  a  minor  part  of  this 
path  selection  scheme  need  the  data  of  elevation.  Second,  if  we  adopt  the 
elevation  estimates  as  our  input  data,  we  will  get  larger  errors  in  the 
calculation  of  -in-path  and  tilt  slope  terrain  variables. 


The  terrain  variables  and  the  variances  together  with  the  corresponding 
explanations  are  listed  below  [20] . 

1.  In-Path  Terrain  Variables  The  in-path  slope  terrain  variable  gives 
the  average  of  the  in-path  slopes  for  the  four  vehicle  wheels  at  each  section. 
This  variable  is  a  measure  of  the  risk  in  the  forward  direction. 

2.  Tilt  Slope.  Tilt  slope  terrain  variable  is  used  to  estimate 
excessive  cross-path  slopes  which  may  cause  the  vehicle  to  tip  over. 

3.  Obstruction  Height.  The  obstruction  height  is  calculated  for  six 
different  locations  at  each  discrete  section  of  the  terrain.  The  maximum 
value  is  then  chosen  as  representative  of  this  whole  section. 

In  deriving  the  formula  for  a  typical  obstruction  height,  we  use  a  third 
order  polynomial  to  approximate  the  terrain  elevation  in  each  location.  By 
differentiating  this  polynomial  with  respect  to  the  distance,  we  get  an 
expression  for  the  slope.  With  the  known  data  of  slopes  at  the  three  points 
substituted  into  this  expression,  we  can  determine  the  coefficients  of  the 
polynomial.  Using  this  polynomial,  we  can  then  find  the'  obstruction  height  in 
this  direction. 

4.  Wheel  Deviation.  The  wheel  deviation  variable  describes  the  offset 
of  any  of  the  four  wheels  from  a  plane.  Wheels  on  any  three  track  poiuts 
define  a  plane.  For  each  combination  of  three  wheels  touching  the  terrain, 
the  deviation  of  the  fourth  wheel  with  respect  to  this  plane  is  defined  as  the 
wheel  deviation. 

A  set  of  the  measurement  data  is  obtained  by  the  described  scanning 
scheme.  The  range  measurement  data  are  processed  by  gradient  estimation 
scheme  to  evaluate  in-path  and  cross-path  slopes  at  the  data  points.  Since 
the  slopes  are  estimated  in  the  spherical  coordinate  system,  it  is  needed  to 
transform  the  range  slope  in  spherical  coordinate  system  to  terrain  slope  in 
cylindrical  coordinate.  The  in-path  and  cro3S-path  slopes  and  their 
covariances  at  the  spline  and  track  points  along  the  corridors  are  evaluated 
by  applying  two-dimensional  interpolation  scheme  over  the  estimated  slopes  at 
the  data  points.  The  terrain  variables  at  a  discrete  section  along  each 
corridor  are  computed  by  using  estimated  slopes  at  the  spline  and  track 
points.  Since  the  terrain  variable  estimates  have  uncertainty,  the  present 
method  increases  the  reliability  by  considering  standard  deviation  as  well  as 
their  mean  values. 

X.  CONCLUSION.  By  taking  an  algebraic  approach,  a  recursive  smoothing 
algorithm  was  developed  as  an  approximation  to  nonrecursive  spline  smoothing. 
Compared  to  the  recursive  smoothing  algorithm  suggested  by  Weinert,  the 
smoothing  algorithm  in  this  paper  is  simpler  in  that  the  scheme  is  in  a 
discrete  form.  Simulation  result  shows  that  the  performance  of  the  recursive 
smoothing  algorithm  is  comparable  to  that  of  its  nonrecursive  counterpart.  In 
addition,  the  computational  complexity  with  recursive  smoothing  algorithm  is 
much  less  than  its  nonrecursive  one.  Also,  recursive  smoothing  by  splines  can 
be  implemented  on-line.  By  taking  an  algebraic  approach,  a  two-dimensional 
recursive  smoothing  algorithm  was  developed  as  an  approximation  to  a 
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nonrecursive  smoothing  spline  technique.  While  the  amount  of  computation 
required  for  a  nonrccursive  algorithm  increases  rapidly  with  the  size  of  the 
two-dimensional  data,  the  amount  of  computation  for  this  smoothing  algorithm 
increases  only  linearly. 
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By  utilizing  Eq.  (A-5),  Eq.  (A-3)  becomes 
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Thus,  smoothing  integral  is  obtained  as: 
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By  defining  B"1  and  A*  as  below: 
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Equation  (A-7)  is  rewritten  in  the  following  form  as: 
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which  is  Eq.  (3)  in  the  text. 
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APPENDIX  B 


In  this  Appendix,  we  show  that  a  piecewise  bicubic  Hermite  polynomial 
which  minimizes  the  objective  function  in  Eq.  (B-l)  becomes  a  bicubic  spline. 


J  -  Jg  +  pJs 


(B-l) 


where 
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Let  a  set  S  be  a  collection  of  all  piecewise  bicubic  Hermite  polynomials. 
Also,  we  define  a  set  U  as  a  collection  of  all  piecewise  bicubic  Hermite 
polynomials  which  satisfy  constraints  sat  D  in  Eq.  (B-4)'. 


s(5i,nj)  -  c(i,j) 

,  i  *  1,2 . N  and  j  w  1,2, 

3s(5i,nj)/3C  -  cc(i,j) 

,  j  -  1,2, • • « ,M  and  i  -  1,N 

3s(Ci,rij)/3n  =  cn(i,j) 

,  i  *  1,2,...,N  and  j  »  1,M 

32s(Ci,nj)/353n  =*  C5>n(i,j) 

,  i  »  1,N  and  j  *  l,M 

Then  the  minimizing  problem  in  Eq.  (A-l)  is  rewritten  as: 


(B-4) 


Min  J  =*  Min  [Jg  +  pJs]  *  Min  [Jg  +  p  Min  J3] 

s(S,n)es  s(5,n)es  D  si^rOeU 


(B-5) 


In  the  paper  by  DeBoor  [21],  it  is  noted  that  there  exists  a  unique 
bicubic  spline  g(5,n)  in  the  set  U.  Also,  by  using  a  standard  technique  to 
derive  the  minimum  norm  property  [2]  of  a  bicubic  spline,  it  can  be  shown 
that: 
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(B-6) 


Since  the  bicubic  spline  g(5,h)  is  unique,  we  have  the  following  Lemma: 

Lemma  1:  A  bicubic  Hermite  polynomial  s(5,n)eU  which  minimizes  the 
smoothing  integral  Js,  becomes  a  bicubic  spline  g(5,n). 

With  reference  to  Eq.  (B-5)  and  Lemma  1,  we  conclude  that  a  piecewise  bicubic 
Hermite  polynomial  s(5,n),  which  minimises  Eq.  (B-5),,  becomes  a  cubic  spline. 
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ON  PHASE  TRANSITIONS  WITH  INTERFACIAL  ENERGY 


* 

Morton  E.  Gurtin 
Department  of  Mathematics 
Carnegie-Mellon  University 
Pittsburgh,  PA  15213 

Consider  a  fluid  wh-sch  has  free  energy  ip ( p )  a  prescribed  function  of 
density  p,  and  which  occupies  a  fixed  container  ft,  with  ft  a  bounded 
region  in  Ir.  If  we  neglect  all  other  contributions  to  the  free  energy,  then 
the  total  energy  EQ(p)  corresponding  to  density  distribution  p(x), 
x  e  ft,  is 

E0(P)  =  /jj  \j>(p(x)  )dx  •  (1.1) 

If  the  fluid  in  ft  has  mass  m,  and  fluid  is  neither  added  to  -  nor 
removed  from  -  ft,  then  the  allowable  density  distributions  must  be  consistent 
with  the  constraint 

/q  p(x)dx  =  m  .  (1.2) 

Following  Gibbs,  we  postulate  that  the  stable  configurations  of  the  fluid 
are  those  which  minimize  (1.1)  subject  to  (i.2).  Thus  we  are  led  to  the 
problem: 

r  minimize  the  energy  Eg ( p )  over 
PQ  <  all  sufficiently  regular  fields  p 
L  that  satisfy  the  constraint  (1.2) 

here  we  shall  be  concerned  with  situations  in  which  i|»(p)  is  nonconvex , 

of  a  form  capable  of  supporting  two  phases  (Figure  1).  In  this  instance  the 

solution  pQ  is  most  easily  described  in  terms  of  the  Maxwell  parameters 

a  ,  3  and  u  defined  by  the  conditions 
0  0  0 

Present  address:  Mathematics  Research  Center,  University  of  Wisconsin- 
Madison,  Madison,  WI  53705. 
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The  line  through  (a^,  4»{a^))  and  (3^,  ♦(Pg))  form*  the  convex  envelope  of 
the  free  energy  between  aQ  and  8g  (Figure  1). 


Figure  1*  Free  energy  {'(p). 

It  is  not  difficult  to  show  that  if  we  restrict  our  attention  to 
continuous  P,  then  PQ  has  no  solution  for  <  r  <  If  we  take 

piecewise  continuous  p,  and  assume  that  the  length  scale  is  chosen  with 

volume (Q)  «  1  , 

then  the  solution  of  Pg  is  as  follows i 
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(i)  for  m  <  Oq  or  m  >  8q  there  is  exactly  one  solution,  the 
constant  field 

P(x)  =  m  ; 

(ii)  for  m  e  (ct^,^)  any  field  p  of  the  form 

fv  ,te2« 

P(x)  =  < 

L  '  x  e  ' 

Q  u  ft2/  fl  ,8  C  n 
0O  "  “ 

volume (fl  )  =  * - —  , 

1  0O  “  “o 
m  -  a© 

volume  (8  }  =  - - — 

p0  "  0 

is  a  solution;  and  all  solutions  have  this  form. 

Thus  for  each  m  2  (a0,3Q)  there  are  infinitely  many  solutions,  some 
corresponding  to  sets  8^  which  are  quite  wild*  This  drastic  loss  of 
uniqueness  occurs  because  interfaces  -  jumps  in  density  -  are  allowed  to  form 
without  a  corresponding  increase  in  energy. 

A  theory  which  attempts  to  overcome  this  difficulty  was  developed 
independently  by  van  der  Waals  [1]  and  Cahn  and  Hilliard  [2]  and  is  based  on 
an  energy  of  the  form 

/jjC'Mp(x)  )  +  e  jVp(x)  |2]  dx  . 

Thus  jumps  in  density  are  not  allowed,  but  rapid  changes  are,  and  such  changes 
are  penalized  in  energy  by  the  presence  of  the  terra  jVp |  . 

One  problem  with  this  theory  is  its  difficulty,  especially  in  space- 
dimensions  larger  than  one.  A  second  problem  is  that,  because  solutions  are 
smooth,  it  is  not  a  simple  matter  to  locate  -  or  even  define  -  an  interfacial 
zone  between  phases. 
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An  alternative  theory  for  problems  of  this  type  was  developed  in  [3]  and 
differs  from  its  predecessors  in  two  respects: 

( 1 )  Jumps  in  density  are  allowed,  but  are  accompanied  by  an  interfacial 


energy. 

(2)  The  density  distribution  is  not  allowed  to  lie  in  the  spinodal 
region  [p1  ,p?] .  It  is  assumed  that  <Ji(P)  is  defined  and  locally  convex  on 
the  intervals  (0,^)  and  (p2,®),  with  nothing  said  about  the  behavior  of 
on  [p, ,p23 . 

For  fields  p  consistent  with  (2)  we  define  complementary  subsets 


n^p)  =  {x  e  £J:  p(x)  <  p^  , 
«2(P)  =  (x  (3  Si:  p(x)  >  c2>  , 


(1.4) 


S2^(p )  being  the  region  in  which  the  fluid  is  in  phase  i.  The  surface 

r(p)  *  asi^p)  n  3fl2(p) 

then  represents  the  interface  between  phases,  and  we  endow  T (p )  with  inter- 
facial  energy 


Ol(p)  , 


where 


(i)  I(p)  is  the  area  of  T(p)  , 

(ii)  o  (assumed  small)  is  the  interfacial  energy  per  unit  area. 
Me  are  therefore  led  to  an  energy  of  the  form 

E(p)  *  if,  <MP(x))dx  +  dl(p)  , 
and  to  the  following  problem; 


minimize  the  energy  E(p)  over  all 
sufficiently  regular  fields  p  which  have 
range  outside  [P1,P21  and  which 
satisfy  the  constraint  (1.2)  . 
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We  establish,  in  [3],  an  existence  theorem  for  problem  P  and  shot*  that 
solutions  p{x)  are  piecewise  constant  with  constant  chemical  potential 
U  *  'i>'(p(x))t  thereby  reducing  P  to  finding  the  global  minimum  of  the 
energy  expressed  as  a  function  of  y.  We  show  further  that,  when  the  solution 

is  considered  a  function  of  the  parameter  m,  with  u  the  corresponding 

n 

.chemical  potential  and  S  the  .associated  minitual  energy, 

m 


We  also  prove  that  solutions  p  of  P  have  minimal  interface;  more 
precisely,  we  show  that  r(P)  has  minimal  area  when  compared  to  all  '  hher 
interfaces 

^  <V  A/  \ 

r  -  3«1  n  3C2 

with 

voluraetO^)  *  volume (p ) )  . 

Also  contained  in  [3]  is  the  general  solution  of  P  in  R1,  as  well  as 
the  solution  for  0  the  unit  square  in  R2  with  *(p)  piecewise  quadratic. 
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ABSTRACT.  Tliis  report  is  about  computational  methods  in  the  detection  ol 
unknown  substances  by  meant;  o(  their  spectra.  The  method  ol  least -squares  is 
used  to  test  whether  an  unknown  spectrum  is  one  ci  a  previously-stored  set  of 
spectra  of  compounds  known  .0  bo  of  in* crest.  The  requirements  arc  that  this 
be  done  with  as  small  and  simple  a  program  as  possible  and  still  get  reasonable 
certainty  ol  deice*  ion.  The  programs  are  first  described,  then  tested  by 
simulation.  The  results  suggest  that  the  procedure  is  feasible  all  hough 
lu>'ther  improvement  is  probably  desirable, 

!.  INTRODUCTION.  This  report  is  ahmii  computational  methods  in  the  detection 
ol  unknown  substances  Ly  means  ol  their  spectra.  The  intent  ol  t he  work  is  to 
devise  a  simple,  rapid  system  lor  this  purpose.  Conceptually,  the  system  might 
consist  ol  a  mass  spectrometer  wltiih  produces  spectra  of  air  samples  and  I  eeds 
t  Item  to  a  computer  that  compares  them  witli  a  reference  seL  ol  spectra.  The 
computer  then  decides  whether  the  spectrum  is  that  of  a  substance  of  interest 
(th.iL  is,  a  member  of  the  reference  set)  or  not. 

An  enormous  amount  of  work  has  been  done  in  the  last  twenty  years  on 
methods  for  identification  of  unknown  chemical  compounds.  Many  different 
experimental  tools  are  available  (mass  spectrometry,  gas  chromatography, 
infrared  spectroscopy,  nuclear-magnetic  resonance,  etc.),  most  of  which  produce 
,1  spectrum  of  some  kind.  The  spectrum  is  compared  co  a  library  of  spectra, 
using  a  variety  of  mathematical  procedures,  loosely  termed  "pattern 
recognition"  or  "cluster  analysis",  frequent  1 y ,  the  final  decision  is  made  by 
a  highly  skilled  scientist  examining  a  number  of  sophisticated,  visual 
representations  ol  the  unknown  spectrum  and  the  reference  spectra.  A  typical 
example  ot  t  lu-  comput at  iona I  effort  involved  is  ARTHUR,  a  collection  of 
programs  for  chemical  data  manipulation  [ 1 J . 

While  this  and  similar  large  statistical  collections  art:  of  great  value  in 
a  laboratory  setting,  our  concerns  here  are  different  in  important  respects. 
Primarily,  our  procedure  must  be  simple;  it  cannot  rely  on  a  skilled  observer, 
nor  can  we  employ  massive  computing  power  and  elaborate  visual  displays. 
Presumably,  modest  compulatiou.il  capability  will  be  available.  We  must  expect 
substantial  noise  in  the  spectrum  since  st-'.:t  laboratory  procedures  cannot  be 
followed.  To  some  extent  these  burdens  arc  rfset  by  the  fact  that  we  are 
interested  only  in  a  spec i lie,  small  set  ol  _hslances. 
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In  ihc  iol lowing  sections  wo  sli.ill  describe  the  general  features  that  wc 
desire  for  this  system  and  the  mat  hem.it  i  cal  and  statistical  questions  implied 
by  tltem.  The  system  requires  various  computing  programs,  which  are  listed  and 
described.  Also,  simulations  ot  system's  operation  are  presented  and 

discussed. 

2 .  GENERAL  ASPECTS .  The  requirements  on  the  system  imply  that  it  must  be 

simple  but  capable  of  dealing  witb  noise.  The  limitation  on  computing  power 
and  the  absence  oi  skillful  users  almost  surely  rule  out  the  melhe-'s  of 
pat tern- recognition  and  cluster  analysis.  We  would  like  to  use  highly 
developed,  efficient  computer  programs  so  as  to  reduce  the  calculaL ional 
burdens.  These  considerations  suggest  that  we  adopt  least-squares  reg.ession 
as  the  basic  method  ior  detection. 

The  main  body  ot  this  report  concerns  questions  that  arise  in  carrying  out 
the  detection  by  least-squares.  There  are  many  variants  of  the  least-squares 
method,  many  programs  ior  carrying  out  the  computations  and  (unfortunately) 
many  pitfalls  in  understanding  Lite  results.  Wc  deal  with  these  in  Lhe 
following  sections  but  wish  now  to  state  'he  problem  in  the  mathematical 
notation  that  will  be  used. 

The  mass  spectrum  ol  a  substance  is  represented  by  the  vector  of  spectral 
intensities  at  .1  suitably  chosen  set  ol  mass  ratios.  For  example,  the  unknown 
substance  with  intensities  y^,  yi....,  at  mass  ratios  indexed  by  1,  2,  .... 
N,  respectively,  are  viewed  as  the  vector  y, 


T 

y  =  [yjt  y2»  yN]  -  transpose  of  y 


Each  of  the  P  reierence  substances  is  also  represented  by  its  vector  of 
spectral  intensities  at  the  same  mass  ratios.  The  vector  X  is  given  by 

xjT-  >V  V  x3j . xnj  1  J  ’  . . . . 

In  the  least  squares  method,  we  assume 


J  -  1 


X,3.  +  c 

j  J 


where  B.  are  coot f icient s  to  be  estimated  and e  is  assumed  to  be  a  vector  of  N 
J 

independent,  Gaussian  random-variables  with  mean  =  0  and  variance  ---  0*. 


The  coefficient  vector. 


B  -  iBj  ,  B2 . B  1 


is  estimated  by  minimizing  e  e.,  which  leads  to  the  loriuula  lor  B  (the  estimate 
of  B> 

«*  T  -IT 

B  -  (X  X)  Ty, 


'  Reproduced  from 
1  ini  available  copy. 


816 


whore  the  matrix  X  has  N  rows,  1’  columns  and  is  given  in  terms  of  its  column 
vectors,  X^,  by 

x  -  lx,  .  .  xpJ. 

The  quantity  oz  is  estimated  by 

S2  *  eTe/(N-P) 

and  the  covariance  matrix  of  B  is  estimated  by 

COV  (11)  »  S2  (XTX)_1 

Finally,  the  estimated  Student-t  of  the  j-th  coefficient  is 

t.“  B./[Var(B)  ,]l/«  U ,/S[ (XTX)"! . J 1 /2 
J  J  I  .1  JJ 

1 1„  t  he  mast,  sped  route  t  or  present:,  tt:.  with  a  spec  L  rum  ol  an  unknown 
substance,  represented  by  the  vector  y,  the  algorithm  consists  of  calculating 
the  t  j  and  comparing  them  with  a  value  'v  2.  If  all  L'j  >  tc,  there  is  little 
reason  lo  think  that  the  unknown  is  any  ol  the  reference  substances.  If 
tj  -  tc,  there  Is  good  reason  to  think  that  the  unknown  contains  a  substantial 
fraction  of  the  j-t  It  re  I  ere  nee  subst.iiue. 

We  shall  discuss  this  procedure  in  the  next  secLion. 

3.  DESCRIPTION  OF  THE  PKOCEIHIHK.  The  procedure  sketched  in  the  previous 
section  for  detecting  the  reference  substances  can  be  carried  out  in  many 
different  ways.  There  are  several  perils  that  attend  Lhe  least-squares  method, 
and  many  dillerent  methods  have  been  developed  to  deal  with  them. 

Perhaps  the  most  interesting  aspect  ol  the  present  detection  problem  is 
the  possibility  that  most  of  the  perils  can  be  eliminated  by  careful 
organization  of  the  computations.  In  particular,  most  of  the  difficulties  with 
the  least-squares  method  arise  out  of  poor  conditioning  of  the  matrix  X,  that 
is,  some  of  the  columns  of  X  are  nearly  linearly  dependent  on  other  columns. 
Sometimes  the  columns  are  badly-scaled,  and  this  also  can  cause  trouble.  The 
point  is  that  none  of  these  difficulties  involve  the  vector  y  directly.  The 

matrix  X,  the  set  of  reference  vectors,  is  known  long  before  Lhe  system  goes 
into  operation.  It  seems,  therefore,  that  we  can  study  and  simplify  the  matrix 
X  under  laboratory  conditions.  If  this  is  done  well,  it.  should  be  possible  to 
avoid  most  of  the  difficulties  that  might  afflict  Lhe  computations  done  alter 
the  unknown  vector,  y,  is  received. 

This  implies  that  wc  should  divide  the  computational  algorithm  so  that  as 
much  as  possible  is  done  "in  laboutory",  before  its  actual  use.  The  results 
are  stored  in  the  "final  computer",  and  only  the  final  steps,  the  parts  that 
involve  the  unknown,  y,  are  done  in  t  lie  litt.il  computation.  In  particular,  all 
exploratory  work  having  to  do  with  the  conditioning  of  the  matrix  X  should  be 
done  in  the  laboratory. 


Reproduced  from 


Once  we  have  adopted  I  lie  viewpoint  lli.il  I  lie  eniuputal  ions  should  be  divided 
into  two  stage:.,  a  preliminary  singe  and  a  final  stage,  we  want  to  use  existing 
algorithms  that  .ire  similarly  divided  into  phases.  The  UNPACK  algorithms  are 
so  arranged  ami  are  at  least  as  el  I  u  ient  and  parsimonious  ol  storage  as  any 
available  programs  lor  doing  least -stpinres,  |?].  For  the  present,  then,  we 
decide  to  base  our  algorithm  on  the  UNPACK  subroutines  (in  FORTRAN)  lor  doing 
least-squares.  It  is  possible  that  subroutines  could  be  written,  specifically 
designed  lor  the  present  situation  (lor  example,  a  very  well-conditioned  X), 
that  would  do  better  than  the  UNPACK  routines  in  some  respects.  However,  the 
improvement  is  not  likely  to  be  great,  and  so  we  limit,  ourselves  to  these 
reliable  programs. 

The  UNPACK  collection  furnishes  three  procedures  for  doing  least-squares 
problems.  They  are: 

T 

(a)  Cholesky  Decomposition  of  X  X, 

(b) .  QK  Decomposition  ol  X,  and 

(c)  Singular-Value  Decomposition  of  X. 

All  three  provide  methods  for  t  rausl  <>i  mi  ng  the  matrix  X  into  a  form  that  is 
convenient  to  solve.  This  will  he  done  in  the  laboratory.  The  1 irsl  .wo  also 
give  stihrotil  i lies  that  solve  lor  II  when  y  is  given.  No  such  program  companion 
io  (c)  is  given,  though  it  would  be  easy  to  write  one.  The  singular-value 
decomposition  is  computationally  expensive  (even  for  laboratory  circumstances), 
and  its  advantages  become  evident  only  when  dealing  with  a  poorl y-condit icned 
matrix.  We  shall  use  it  in  the  exploratory  phase  of  our  work,  when  we  are 
studying  '.he  condition  of  a  set  ol  relerenrc  vectors,  but  we  do  not  use  it  as 
part  of  the  scheme  once  a  we» l-co»d't  ioned  set  of  reference  vectors  has  been 
found. 

For  this  situation  we  have  to  choose  between  (a)  Cholesky  Decompostion  and 
(b)  QR-Qecompost ion.  The  UNPACK  manual  equivocates  on  this  question,  so 
parallel  programs  were  written  that  used  bot.h  methods.  There  was  no 
significant  difference  between  t hem  in  time-tests,  but  the  Cholesky 
Decompostion  algorithms  used  about  107.  fewer  words  of  instruction  than  the  QR 
algorithms  for  the  same  amount  of  data.  Accordingly  we  decided  to  use  Cholesky 
Decomposition  as  our  algorithm.  The  machine  version  of  the  final  p-'gram  for 
analyzing  a  (30  x  13)  matrix  required  in  instructions  and  data  9  j 0  36  bit 
words  on  a  UN1VAC  1106,  of  which  6300  words  were.  FORTRAN  overhead. 

The  computer  programs  LAB  and  FINAL  are  listed  in  Appendix  A  and  described 
by  their  documentation.  The  LAB  program  reads  in  t  lie  matrix  X  ol  spectral 
intensities  of  the  reference  set  of  substances.  The  symmetric  matrix 

T 

A  ■  XX 

is  formed  and  factored  into  l  lie  pnxhn  I 

A  *  K 1 R 
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where  R  is  ail  upper  triangular  matrix.  The  LINPACK  routine  SPPFA  finds  the 
matrix  R-  Also  t  lie  matrix  A’"  is  (mind  by  subroutine  SPPDI ,  and  th,o  vector  V 
with 

V.  =  |  A'1  .  ./(NR-NC)  J1/2 

J  :.i 

is  calculated.  The  arrays  X,  R  and  V  are  written  in  file  7  to  be  used  by  the 
FINAL  program. 

The  FINAL  program  tcads  the  contents  of  file  7  ..nd  the  vector  spectrum  of 
of  the  unknown,  y.  The  vector  W  =  X*y  is  calculated  and  the  linear  system 

AB  X^XB  r.  x‘y  =  W 

solved  by  LINPACK  routine  SPPSL.  Then  the  residual  vector 


and  its  sum  ol  squares, 


e  =  y  -  XB 


S2  *  e*e 


e 


2 

i 


are  found.  Finally,  the  t-values  associated  with  E.are  written  out  ‘if  t. . 
The  FINAL  program  has  a  loop  lor  simulating  randomness  efiects.  It  reads  in  a 
basic,  unknown  vector,  II,  and  generates  NR  sample  y-vcctors,  obtained  by  adding 
to  U  a  vector  of  Gaussian  random  i. umbers  with  mean  =  0  and  standard  deviation 
DSD. 


These  programs  are  used  later  in  studying  the  effects  on  detection  of 
altering  various  properties  of  the  reference  set  or  unknown. 

4.  TESTING  THE  PROCEDURE  IN  THE  LABORATORY.  A  trial  set  of  data  was 
furnished,  perhaps  typical  cf  the  spectra  thaL  the  procedure  must  treat.  It 
comprises  the  spectra  of  13'  substances,  each  spectrum  being  defined  by  30  peak 
intensities,  hence  is  represented  by  a  matrix  with  30  rows  and  13  columns.  The 
matrix  is  shown  in  Table  1.  We  now  proceed  to  analyze  it,  that  is,  wc  do  the 
laboratory  part  of  the  statistical  treatment. 

The  first  step  is  to  rearrange  the  data  matrix  so  the  columns  are 
normalized  ;o  have  length  10,000,  and  the  rows  arc  ordered  according  to  their 
median  entries.  That  is,  the  median  ol  each  row  is  found  and  the  rows 
exchanged  so  that  the  iirst  row  is  the  one  with  the  largest  median,  the  second 
row  has  the  next  largest  median,  etc.  Table  2  shows  this  rearrangement  of  the 
array,  which  we  now  designate  the  matrix  X;  this  cable  also  includes  a  trailing 
column  (not  p.<rt  of  X),  giving  for  each  row  its  index  in  the  original  dam- 
matrix.  The  matrix  X  5s  the  object  «ii  the  study  that  follows. 
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Our  prel iminary  invcsL igaL ion  is  concerned  mainly  with  the  condition  of 
the  matrix  X.  Also  we  are  interested  in  the  following  notion.  A  glance  at 
Table  2,  matrix  X,  shows  that  the  lower  rows  have  vei  y  small  entries  compared 
with  the  first  tew  rows.  Since  there  is  some  random  error  in  the  matrix 
elements  in  any  case,  it.  is  doubt  Ini  that  these  rows  are  contributing  much 
information  to  the  idem  i  i  ic.it  ion  ol  the  precise  substances.  Perhaps  little 
discerning-power  would  he  lost  if  I  hose  rows  were  deleted.  However,  we  have  to 
be  careful  about  doing  Ihis  for  reasons  explained  in  the  next  paragraph. 

There  are  two  levels  of  identification  that  concern  us  in  this  project. 
Primarily  we  are  interested  in  the  question  "Does  the  unknown  spectrum  contain 
a  significantly  non-zero  concentr.it  ion  ol  any  of  the  reference*  substances?" 

Only  secondarily  are  we  interested  in  which  specific  substance  is  present  and 
its  concentration.  (Phis  is  somewhat  diiferent  from  the  relative  importance 
usually  accorded  these  two  questions  in  a  laboratory  setting  and  is  one  way  in 
which  this  problem  differs  i rom  chemical  identification  in  the  laboratory.) 

Tie  importance  of  the  rows  with  small  entries  is  slightly  different  with 
respect  to  these  two  questions.  Tlu*  small  entries  may  not  have  much  ellecL  on 
the  determination  of  the  specific  substances  but  may  have  a  telling  impact  in 
deciding  whether  an  unknown  substance  belongs  to  the.set  or  noL.  For  the 
unknown  could  differ  from  a  member  of  the  set  by  having  a  high  peak  where  all 
members  of  the  reference  seL  have  low  intensity.  If  this  peak  is  d>  leted.  we 
could  misLakonly  identify  the  unknown  as  a  member  of  the  set,  that  is,  we  risk 
setting  oil  a  false  alarm.  W<*  inusi  ,  therefore,  be  very  cautious  about  deleting 
rows  of  X. 

There  are  many  ways  of  assessing  the  condition  of  the  matrix  X.  They 
range  from  very  simple  procedures,  sueh  as  finding  the  correlation  matrix  for 
the  columns  of  X  up  to  doing  the  singular  value  decomposition  of  the  matrix. 

We  hope  to  find  and  perhaps  remove  any  near  dependencies  (linear 
relationships),  among  the  columns  of  X.  The  most  complete  information  about 
the  presence  of  these  dependencies  is  furnished  by  th/£  singular-values  of  X, 
the  positive  square-roots  of  the  eigenvalues  of  A  -  X  X.  In  particular,  the 
presence  of  dependencies  is  indicated  by  high  values  ol  the  raLio  ol  the 
largest  to  the  smallest  singular-value. 

A  convenient  procedure  for  investigating  the  dependencies  among  the 
columns  of  X  is  called  V-NULL  and  is  part  of  the  collection  called  ROSEPACK. 

It  causes  a  singular-value  decomposition  to  be  done.  Estimates  of  the 
singular-values  and  their  ratio  to  the  largest  singular-va’.ue  are  printed  oat. 
The  user  is  than  asked  to  name  the  index  of  the  smallest  non-zero  singular- 
value.  This  choice  implies  a  decision  that  Lhe  rank  of  the  matrix  is,  say,p  . 
Then  there  will  exist  N-P  dependent  columns  and  p  independent  columns,  and  the 
program  prints  out  estimates  of  the  coefficients  in  the  relations  between  the 
dependent  and  independent  columns. 

This  program  was  run,  using  the  matrix  X  as  input,  with  results  shown  in 
Table  3-  The  ratio  of  the  largest  to  smallest  sirgular-value  is  1/0.00935  * 
107,  which  is  a  bit  large  for  comfort  though  not  indicative  of  major  trouble. 
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In  particular,  a  stcm-and-leat  display  of  the  logarithm  of  the  ratio  of  the 
largest  singular-value  to  each  ol  l  he  others.  Table  4,  shows  no  striking  gaps. 
Hence  there  is  no  obvious  choice  ol  rank.  A  tentative  selection,  p  =  12,  was 
made,  and  the  program  calculated  that  column  4  is  dependent  and  the  dependency 
relation  is: 

C.  -  -  0.1480.,  .  0.117C-  -  O.OOfcC,  .  0.039C,  r  0.857C,-  0.570C,  +  0.078Cfl  - 
4  2J1jo78 

0.235C9  +  0. 3580 jQ  +  0.126Cn-  0.2510^  +  0.631Cn 

We  conclude  from  this  thaL  the  matrix  X  is  not  extremely  close  to  being 
singular.  There  is  no  strong  relationship  among  the  columns  of  X,  but  the 
strongest  ol  the  somewhat  weak  dependencies  is  between  and  the  columns 
having  large  coefficients,  namely  C^,  C13,  and  C -j.  The  relation  between  C4  and 
C5  is  depicted  as  particularly  close  in  this  treatment,  and  scrutiny  of  X 
confirms  that  these  columns  are  similar. 

At  this  point  we  might  choose  Lo  remove  (or  C(j)  from  X  if  we  thought 
the  relation  between  them  was  close  enough  to  adversely  affect  thr  results. 
Rather  than  doing  LhaL,  however,  we  use  V-NULl.  to  determine  what  is  the  effect 
on  the  condition  of  X  when  we  delete  rows  with  small  entries,  the  lower  rows  of 
X.  The  results  of  deleting  the  last  11  and  14  rows  of  X  are  also  shown  in 
Table  3.  The  singular  values  are  not  changed  much,  which  suggest  that 
omitting,  say,  the  last  10  rows  of  X  would  not  greatly  weaken  our  ability  to 
distinguish  the  columns  of  X.  However ,  we  prefer  to  retain  these  rows  for 
reasons  mentioned  earlier. 

In  this  section,  wo  have  discussed  procedures  for  detecting  poor 
conditioning  ol  the  matrix  X.  The  l real menl  has  not  been  exhaustive  by  any 
means.  For  example,  wc  have  not  used  the  idea  of  the  variance-decompostions 
proportion  lor  the  sake  of  brevity,  |3|.  In  general,  we  shall  do  everything 
necessary  in  a  laboratory  computation  lo  .sure  that  X  is  well-conditioned 
before  it  is  used  in  the  final  computation. 

5.  TESTING  THE  FIELD  PROCEDURE.  In  this  section  we  describe  the  results  of 
tests  of  the  program  for  doing  the  final  computations.  These  tests  are  in  the 
form  of  Monte-Carlo  simulations  in  whiih  the  unknown  vector,  y,  is  proportional 
to  one  of  the  columns  of  X  with  added  noise.  A  few  tests  are  also  done  in 
which  y  is  purely  random  noise;  its  only  resemblance  to  the  reference  set  is  a 
chance  one. 

The  computational  procedure  starts  with  the  matrix  X,  hopefully  well- 
conditioned,  and  applies  the  LAB  program  to  create  the  file  that  will  be  read 
by  the  FINAL  program.  The  FINAL  program  reads  this  file,  then  reads  the 
unknown  vector,  U,  generates  NR  samples  of  y  by  adding  noise,  and  analyzes 
each.  The  random  noise  is  independent  Gaussian  with  standard  deviation  (SD). 
Tests  were  made  with  SD  -  100,  200,  300  and  500.  It  is  thought  that  SD  =  30C 
corresponds  roughly  to  a  noise  level  of  107.  because 
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TABLE  4*  DISTRIBUTION  OF  LOG  CSUR) 

ie 

SUR  •  (LARGEST  SINGULAR  UALUE/EACH  SINGULAR  UALUE) 


STEM-AND-tEAF  DISPLAY  OF  LOG(SUR) 
LEAF  DIGIT  UNIT  •  .1000 

1  2  REPRESENTS  1.2 


1  +0*  0 

1  +0T 

2  +0F  5 

3  +0S  ? 

4  +0*  8 

5  18  0 

(2)  IT  22 

6  IF  45 

4  IS  7 
3  1.  88 

1  28  0 


2(SD)'v(median  ot  largest  values  in  eat h  column)/10,  so  that  these  levels 
bracket  the  noise  we  expect  to  encounter  in  the  field. 


The  Monte-Carlo  simul.it  ions  for  the  iirst  condition  were  done  by  taking 
each  column  ot  X  in  turn  as  the  basic  unknown  vector,  U,  then  generating  NR  = 

20  or  AO  samples  of  y.  In  this  way  260  or  520  trials  were  made  at  each  noise 
level.  The  results  are  stated  in  lenns  of  three  integers, 

NND  =  number  of  trials  in  which  nothing  was  detected 

NED  -  number  of  trials  in  which  the  true  column  was  detected 

NWD  =  number  oi  trials  in  which  a  column  was  wrongly  detected, 

where  detection  of  a  column  means  that  the  t-value  for  the  regression 
coefficient  of  that  column  was  (a)  larger  than  2  and  (b)  larger  ihan  the  t- 
values  for  the  other  columns. 

The  results  are  shown  in  Table  5  for  SD  =  200,  300,  and  500.  Trials  were 
run  for  SD  =•  100,  but  the  results  are  n  t  shown  because  they  were  perfect;  all 
260  trials  resulted  in  exactly  correct  detection.  In  looking  at  these  results 
we  must  bear  in  mind  that  we  would  like  the  system  to  yield  primarily  a  small 
number  for  NND,  because  in  these  trials  the  unknown  is  always  a  member  of  the 
reference  set.  If  NND  is  large  it  means  that  the  system  often  fails  to  detect 
the  presence  of  a  reference  substance.  Table  V  shows  that  the  overall  ratio  of 
NND  to  the  number  of  trials  is: 

for  SSD  «  200,  r  s  NND/260  =  H/2b0  *  0.031 
NL) 

for  SSD  *  100,  r  =  NND/520  -'.H/')20“  0.112 
ND 

for  SSD  =  500,  r  ^  NND/260  »  60/260  »  0.231 
ND 

These  results  suggest  chat  the  system  detects  the  presence  of  some  reference 
substance  (when  one  is  present)  in  about  89%  of  the  trials  at  this  noise  level. 
If  the  instrumentation  could  be  improved  so  that  SSD  =  200  (about  a  6%  noise 
level),  the  estimated  detection  probability  rises  to  97%. 

The  table  also  furnishes  information  about  the  less  crucial  determination 
of  the  exact  substance.  We  see  that,  the  overall  probability  of  this  is 
estimated  to  be: 

for  SSD  =  200,  r„^  =  236/260  ^  0.908 

ED 

for  SSD  =  300,  rpn  =  386/520  =«  0.7A2 
ED 

for  SSD  -  500,  r__  =  128/260  -  O.A92. 

bu 

These  are  less  satisfactory  than  the  previous  resulcs,  but  also  less  important. 


825 


The  table  also  tells  us  quite  .1  lot  about  the  specilic  columns  that  are 
well  or  poorly  detected.  We  see  that  lor  SD  =  300  columns  2,  3,  5,  9,  and  10 
are  very  distinctive.  When  they  are  present,  something  is  always  detected  and 
almost  always  the  correct  column  is  Inund.  Columns  1  and  8  are  less  well- 
defined  but  Siam'  out  fairly  clearly.  Contrariiy  when  column  4  is  present, 
the  system  has  a  lot  ol  trouble  in  detecting  anything,  and  when  something  is 
found,  iL  is  often  identified  incorrectly.  To  a  lesser  extent,  this  is  also 
true  of  columns  6,  7,  11,  12,  and  13.  These  results  agree  fairly  well  with  the 
implications  ol  the  V-NULL  analysis  described  in  the  previous  section.  For  SD 
=  200  and  500  these  distinctions  are  less  clear  than  at  SD  =  300  but  coarsely 
similar. 

The  oflect  of  deleting  t  lie  last  1  on  rows  of  I  lie  matrix  X  was  investigated 
by  a  similar  Monte-Carlo  procedure.  The  results  are  not  shown  in  detail,  but 
the  overall  probabilities  of  no  detection  and  exact  detection  are: 

for  SD  =  200, rND  =  0.077, r£[J  =  0.862 

for  SD  =  300, rN[)  =  0.165,r£D  =  0.677 

for  SD  =  500,  r  =  0.315,r„„  «  0.431 

ND  LU 

There  is  a  statistically  significant  loss  of  discrimination  (at  the  95%  level) 
in  these  results,  as  compared  with  those  using  the  full  matrix. 

The  V-NULL  analysis  suggests  that  the  most  nearly  dependent  column  of  the 
matrix  X  is  column  4.  It  is  natural,  then,  to  investigate  the  effect  on  the 
detection-process  of  removing  column  4  l rom  the  maLrix.  Table  6  shows  the 
results  for  this  simulation  carried  out  lor  SL'  -  300  and  compares  them  with  the 
results  for  the  complete  matrix.  The  estimated  detection  probabilities  are: 

Complete  X  X  without  col.  4 


rED 

0.742 

0.831 

rND 

0.112 

0-071 

rWD  . 

.  0.146 

0.098 

We  see  that  deleting  column  4  has  a  generally  favorable  effect  on  detection 
probability,  although  part  of  the  improvement  is  due  to  the  fact  that  we  did 
not  use  column  4  as  an  unknown  in  these  simulations.  Examination  ot  Table  6 
shows  that  most  of  the  improvement  was  caused  by  the  elimination  of  column  4 
from  consideration  and  better  detection  of  columns  6,  7,  and  13.  This  agrees 
well  with  the  results  cf  the  V-NULL  analysis  described  in  the  previous  section. 
However,  we  notice  also  that  column  11  is  detected  less  well  than  before.  The 
reason  for  this  is  not  clear.  In  a  real  life  situation  this  would  lead  us  to 
do  further  analysis  of  the  null-space  of  X,  but  we  shall  not  pursue  it  here. 
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In  all  the  cases  described  so  far,  Lhe  unknown  vector,  y,  was  assumed  to 
consist  of  one  column  of  X  plus  noise.  A  few  simulations  were  also  run  in 
which  y  was  a  linear  combination  of  two  columns  of  X  with  added  noise.  The 
results  were  much  worse  then  witli  one  column.  The  set  of  simulations  was  far 
from  complete,  but  the  probability  ot  detecting  nothing  when  two  columns  are 
present  appears  to  be  about  0.25,  which  ia  quiLe  poor. 

It  still  remains  to  investigate  the  propensity  of  this  system  Jor  giving 
false  alarms.  For  this  purpose  it  would  be  desirable  to  have  a  s<. c  of  mass 
spectra  typical  of  substances  that  might  be  present  at  the  same  time  as  those 
of  the  reference  set.  Members  of  this  set  would  then  be  used  as  the  y-vectors, 
and  we  could  see  whether  Lhe  system  mi s i dent i f ies  any  of  them.  Unfortunately 
such  a  set  is  not  available.  A  much  less  desirable  procedure  is  to  generate 
random  vectors  from  some  plausible  distribution  and  see  what  fraction  of  them 
is  identified  as  one  of  the  reference  set.  Since  we  are  using  a  critical  t- 
value  of  2,  wc  expect  that  about  57,  of  the  cases  will  be  identified  as  members 
of  the  reference  set  if  the  underlying  distribution  is  close  to  Gaussian.  A 
few  preliminary  trials  of  this  nature  were  carried  out,  with  the  expected 
results,  but  it  is  difficult  to  see  what  could  be  learned  from  a  large 
simulation  since  relatively  little  is  known  about  the  distribution  of  the  mass 
spectra  in  natural  environments. 

6.  CONCLUSIONS  AND  DISCUSSION.  The  principal  conclusion  that  can  be  dr.iwn 
from  this  effort  is  that  the  proposed  scheme  appears  workable,  provided  the 
real  life  situation  is  reasonably  close  to  what  is  assumed  in  the  simulations. 
It  is  likely  that  with  care  in  the  conditioning  of  the  reference  matrix,  we  can 
achieve  successful  detection  in  more  than  907,  of  the  trials.  The  probability 
of  false  alarms  is  less  well  determined,  especially  under  conditions  that  might 
arise  in  military  operations. 

The  following  comments  ar.d  suggestions  appear  relevant: 

(a)  It  would  be  very  desirable  to  obtain  spectra  of  sampN  volatiles 
occurring  under  natural  condit  ions.  We  must  have  this  or  equivalent 
information  in  order  to  see  whether  i he  system  will  produce  false  alarms  under 
operating  condit  tons. 

(b)  Further  thought  should  be  given  to  the  problem  of  improving  the 
condition  X.  The  procedure  exemplified  in  section  4,  simply  deleting  the  most 
dependent  column  is  widely  practiced  but  sometimes  unreliable.  On  the  other 
hand,  ^he  procedures  usually  suggested  as  replacements  (see  Belsley,  Kuh,  and 
Welsch  ),  ridge  regression  and  mixed  or  Bayes ia;-  estimation,  involve  appending 
questionable  information,  beyond  that  available  from  the  spectra,  and  are, 
therefore,  also  open  to  criticism. 


(c)  The  iinal  program  used  here  is  quite  small  and  is  probably  within  the 
capacity  of  most  minicomputers.  However,  it  is  in  FORTRAN  and  perhaps  will 
need  reprogramming  into  BASIC  or  some  assembly  language  for  actual  use. 


rfwtTrnsvw  -■'j*5 
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(d)  We  have  not  conducted  any  formal  search  for  outlying  or  highly 
influential  rows  in  the  data  matrix.  We  have  omitted  this  aspect  of  the  study 
because  it  is  reasonably  clear  from  the  results  already  found  that  for  our 
purposes  none  of  the  rows  are  excessively  influential.  For  if,  say,  one  row 
was  very  influential,  the  B  values  would  fluctuate  wildly  from  one  trial  to 
the  next,  according  to  the  randomly  changing  y-entry  in  that  row.  Our 
principal  concern  is  detecting  the  j  for  which  Bj  is  largest  and  deciding 
whether  that,  tj  >2;  this  primary  conclusion  is  quite  reproducible  from  one 
trial  to  the  next.  It  is  possible  that  other  estimates  are  influenced  mostly 
by  one  row,  but  there  is  little  evidence 4  that  the  primary  conclusion  is  so 
affected.  Nevertheless,  it  might  be  interesting  to  examine  the  hat  matrix 


H 


T  - 1  T 
X  (XX)  X 


for  this  problem.  Because  of  the  initial  ordering  ef  the  rows  of  X.  it  seems 
plausible  that  each  row  would  be  more  influential  than  its  successor,  but  this 
has  to  be  checked. 


(e'  In  conclusion,  it  is  a  good  idea  to  re-emphasize  the  factors  that 
make  this  problem  different  from  a  routine  application  of  least-squares 
regression.  They  are 

(1)  limited  computing  power  and  data  accuracy, 

(2)  a  modest  number  of  reference  vectors,  and 

(3)  a  very  specific  objective,  namely,  to  see  whether  the  unknown  vector 
contains  a  statistically  significant  amount  of  any  of  the  reference  vectors. 
Although  we  would  like  to  detect  which  vectors  are  present,  that  is  much  less 
important  than  detecting  whether  any  ere  found. 
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APPENDIX  A 


PROGRAM  LAB 


CSX  THIS  II  TnC  !N-La»ORATO«Y  Oil  AAF.LIMXNARV 
CSSS  AART  Or  THE  ALGORITHM  FOR  CHEMICAL-DETECTION 
CSSS  IV  USE  Of  CHOLESKY-DECOMAOimON.  ?*  SACCTAAt- 
CSSS  INTENSITY  NATRIX.  X,  IS  HEAD  tN  AND  IVMMCTAIC 
CSSS  MATRIX  A  GENERATED.  THII  ZS  THEM  FACTORED  AND 
CSSS  THE  INVERSE  FOUND.  X.A  AND  THE  DIAGONAL  Of  THE 
CSSS  INVERSE  ARC  URITTIn  IN  FILE  ?.  wHCHCC  THEY 
CSSS  ARC  READ  IV  THE  FIELD  AAOCRAA  Cf . 

RIAL  X(3S,13).AtlH>.UU3) 

ISA  FORMAT ( ) 
lil  FORNAT( 13FS.S) 

2SS  F0RNAT<SE1S.7) 

291  FORNAT(SIS) 

CSSS  READ  NR.NC  AND  THE  MATRIX  X.  THEN  WRITE  X 
CSSS  IN  FILE  7. 

READ <S. ISA >  NR.NC 
U«IVE(7,2S1)  NR.NC 
DO  4  1*1. NR 

READ(S.ISI)  (X(  It J ),J-1,NC ) 

4  URITE(7.2AA)  (XCl.JJ.J-l.NO 
CSSS  CALCULATE  THE  UPAER-TRIANCLE  OF  THE  SYMMETRIC 
CSSS  MATRIX  A  •  I X-TRANSAOSE )SX  AND  STORE 
CSSS  IN  VECTOR  REPRESENTATION. 

CO  2S  J-l.NC 
DO  IS  1-1, J 
K-ICM 

H(K)*$DOT(NR,X(  1, 1  ),1,X(1,J),I3 

IS  continue 
as  continue 

csss  USE  LINAACIC  SUtROUTDC  SANTA  TO  DO  CMOtESKv- 
Cstg  FACTORIZATION  OF  SYMMETRIC  MATRIX  A.  WRITE  THE 
CSSS  RESULTING  MATRIX  A  IN  riLC  7. 

CAU  SfAfAlA.MC.INFO) 

IFC IhfO.CT.S )  URITEtt.lSI ) ,INFO 
WRITE(7,aH>  (AOCK), «•!.*> 

CSSS  CALCULATE  THE  DIAGONAL  TERMS  OF  THE  INVERSE  Of 
csss  SYMMETRIC  MATRIX  A  AMD  DIVIDE  IV  T*  DECREES 
Csss  OF  FREEDOM  TO  OCT  THE  VECTOR  V.  THEN  WRITE 
Cits  SQRT(V)  IN  FILE  7.  UHAACK  SUBROUTINE  SAADI  It 
CSSS  USED  TO  FINS  THE  INVERSE  OF  A. 

CALI  SAADI  (A, NO, M> 

C'N*~NC 

JJ-S 

DO  1  J-I.HC 

1  V(J)*SCRT(A(JJ)/C/ 
aRITE<7,SS9)  (V(J).J-l.NC? 

END 


V  '/  - 
■  V  V  \-  .*•  V 


V  V-  V  V  J- 


,Vv 


PROGRAM  FINAL 


1 

2 

3 

4 

5 

6 

7 

8 
9 

19 

11 

12 

13 

14 

15 
IS 

17 

18 
19 
28 
31 
22 

23 

24 

25 

26 

27 

28 

38 

31 


C  THIS  XS  THE  FXNM,  PART  OF  THE  PROCEDURE  FOR 
C  IDENTIFYING  UNKNOWN  SPECTRA.  IT  READS  IN  THE  UNKNOWN 
C  SPECTRUM  AND  DOES  A  LINEAR  REGRESSION  UIA  CHOLESKV 
C  DECOMPOSITION  TO  ESTIMATE  THE  REGRESSION  COEFFICIENTS  AND 
C  THEIR  T-UAUJES.  LARGE  T-UALUES  CORRESPOND  TO  THE 
C  MOST  PROBABLE  KNOWN  SUBSTANCES  IN  THE  LIST  CF  THOSE 
C  PREVIOUSLY  ACQUIRED  FROM  THE  ‘LAB*  PROGRAM,  CM  .  AS  MUCH 
C  AS  POSSIBLE  OF  THE  UORK  HAS  BEEN  DONE  IN  THE  LAB 
C  PROGRAM. 

C 

PARAMETER  LDR-30,  LDC-13,  LDP-185 
REAL  X(LDR,LDC  ),U(LDC ),A(LDP),U(LDR) 

REAL  B(LDC ),V(LDR > 

INTEGER  XR(LDC) 

DOUBLE  PRECISION  DSD 

Cttt*  READ  IN  DATA  CREATED  BV  THE  "LAB"  PROGRAM,  CH, 

READ  (7,98 >  NR,KC 
DO  10  I • t , NR 

18  READ  (7, 128)  <X(X,J),J-i,NC) 

K-NC*(NC*l>/2 

READ  (7,120)  ( A(KK).KK-i,K ) 

READ  (7,120)  (V(J),J-1,NC) 

Cm*  READ  IN  DATA  FOR  UNKNOWN  SPECTRAL  INTENSITIES. 

DO  20  I  *>  1  c  NR 
20  READ  (5,120)  U(I) 

READ  (5,110)  NREP,SD,DSD 

C**»*  START  OF  LOOP  FOR  SIMULATING  RANDOMNESS  EFFECTS 
DO  80  L-l ,NREP 

Cm*  INTRODUCE  RANDOM  GAUSSIAN  NOISE  . 

DO  30  1*1, NR 

Y(I)-U<X)+SD*GGNQF(D$D) 


PROGRAM  FINAt,  CONTINUED 


39 

c  am 


cm* 
c  m* 


60 

cm* 


IF  ( Y  ( I ) , LT . 0 . 0 )  V ( I  )a0<0 
FIND  REGRESSION  COEFFICIENT  ESTIMATES,  B(J)  . 

DO  40  J«i,NC 

B<J)-SDOT(NR,X<l,J),l,Y,l) 

CALL  SPPSL  <A,NC,8) 

FIND  RESIDUALS  AND  T-UALUES  OF  COEFF.  ESTIMATES. 

*RRJ)(  IS  USED  F0R  80TH  ^SIDUALS  AND  T-UALUES 
DO  5*  1*1, NR 

V<n-$DOT<NC,xa,i),lDR,B,l)-V<n 

S*SNRMS(NR, Y, i  ) 

DO  60  J*1,NC 
IR( J )«J 
SE*U< J)*S 
Y(J)-B(J)/SE 

SORT  ON  T-UALUES  AND  URITE  OUT  FOR  T  >  1.0  . 

CALL  USRTR  X\',NC,IR) 

WRITE  (6,90)  L 
DO  70  J«NC,1,-1 

K*IR(J) 

IF  (Y(J).LT.  1.0)  GO  TO  80 
WRITE  i 6,100)  K,B(K),Y( J) 

CONTINUE 


FORMAT  <815 ) 
FORMAT  <I3,2F7.4) 
FORMAT  <) 

FORMAT  (5E15.7) 


^  ."■j 
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MATHEMATICAL  FOUNDATIONS  FOR  ROBOTICS 


John  E.  Hopcroft 
Department  of  Computer  Science 
Cornell  University 
Ithaca,  N.Y.  14833 

ABSTRACT.  A  major  component  of  robotics  is  concerned  with  the  represen¬ 
tation  and  manipulation  of  physical  objects.  The  design  of  computer  algo¬ 
rithms  to  interact  with  these  representations  and  to  carry  out  task  planning 
raises  many  mathematical  and  computational  issues  that  are  not  well  under¬ 
stood.  This  talk  will  explore  some  of  these  issues  and  outline  areas  where 
mathematical  development  is  needed  to  support  a  robotics  effort. 

X.  INTRODUCTION .  Task  level  planning  is  an  important  aspect  of  robot¬ 
ics.  What  is  needed  is  the  ability  to  communicate  a  task  to  a  robot  at  the 
level  of  "insert  peg  in  hole"  and  have  the  detailed  instructions  such  as  opes 
gripper,  move  to  location  x,  rotate  gripper,  etc.  be  automatically  generated 
by  a  computer.  To  do  this  requires  solving  many  problems.  First,  one  must  be 
able  to  automatically  deduce  a  grip  position  that  will  not  interfere  with  the 
desired  task.  Then  one  must  develop  an  approach  strategy  that  will  insure 
gripping  even  in  the  presense  of  some  uncertainty  of  position.  Finally  one 
must  be  able  to  plan  motion.' 

Each  of  these  tasks  requires  the  ability  to  represent  physical  objects  in 
a  computer  in  such  a  manner  that  they  can  easily  be  reasoned  about.  Although 
researchers  in  computer  graphics  have  developed  computer  models  cf  surface 
representations  and  efficient  ray  tracing  algorithms,  these  techniques  are  not 
suited  to  representing  the  physical  objects  themselves  for  several  reasons.  A 
surface  is  easil.  :  -resented  by  bicubic  pat-hes.  However,  a  bicubic  patch  is 
an  18  degree  surface  ani  thus  the  intersection  is  a  degree  36  curre.  This 


precludes  symbolic  or  algebraic  calculations  of  intersections.  Furthermore*  a 
circle  does  not  have  a  polynomial  parameterization  and  hence  a  cylinder  cannot 
be  represented  exactly  by  bicubic  patches.  This  prevents  reasoning  about  a 
cylinder  rotating  in  a  circular  bearing.  One  alternative  to  the  bicubic,  patch 

is  the  Steiner  surface  with  the  canonical  equation 

22,22,22  « 

xy  +xz  +yz  -2xyzv  =  0 

This  surface  is  of  fourth  degree  so  that  algebraic  techniques  are  still  possi- 
ble.  Furthermore,  all  quadric  surfaces  are  special  cases  of  it. 

Another  difficulty  with  simply  geometric  and  topological  models  of  solids 
is  that  solid  objects  have  an  internal  structure  that  is  important.  If  one  is 
designing  an  object  by  editing  an  already  existing  solid  model  of  an  object* 
it  is  important  that  changes  to  the  object  are  reasonable.  Thus  if  an  object 
has  two  holes  aligned  on  an  axis*  moving  one  of  the  holes  should  move  the 
other.  To  design  reasonable  editors  requires  that  a  theory  be  developed  to 
provide  some  insight  into  this  structure. 

Finally,  to  reason  about  objects  and  transformations  of  objects  requires 
some  mathematical  framework.  We  wil?  Mustrate  the  type  of  theory  that  is 
needed  by  one  example.  It  involves  mo  A  objects  in  contact.  In  order  to 
state  the  theorem  we  first  provide  a  simple  definition  of  an  object  and  of  a 
motion. 

For  our  purposes  an  object  is  a  parameterized  homeomorphic  supping  from  a 
canonical  form  of  the  object  to  a  region  of  three  space.  If  parasKters 
specify  the  position  and  orientation  of  the  object*  then  to  instantiate  a  copy 
of  the  object  one  supplies  the  x*y*z  coordinates  and  the  orientation.  How¬ 
ever*  one  can  instantiate  more  than  just  the  location  and  orientation.  There 
could*  for  example,  be  parameters  for  length*  width  and  height  allowing  one  to 
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instantiate  size.  More  generally  even  the  shape  can  be  parameterized  allowing 
one  to  instantiate  shape. 

A  motion  is  simply  a  path  in  parameter  space.  In  this  setting  a  motion 
may  be  a  translation,  rotation,  growth  cr  continuous  deformation.  This  gen** 
erai  approach  allows  us  to  deal  with  non  rigid  objects*  objects  that  can 
change  shape. 

The  type  of  theorem  that  can  be  proved  is  the  following: 

Theorem:  Given  an  initial  and  a  final  configuration*  I  and  F*  of  two  objects 

\ 

in  which  the  objects  are  in  contact*  if  there  is  any  motion  from  configuration 
I  to  F,  then  there  is  a  motion  in  which  the  objects  stay  in  contact  at  all 
time. 

This  work  is  part  of  a  program  to  develop  a  theory  of  solids*  how  to 
represent*  manipulate  and  reason  about  them.  This  theory*  sterophenomenology* 
is  essential  to  the  development  of  advanced  computer  automated  design  and 
manufacture . 
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DYNAMIC  RESPONSE  IN  AN  ELASTIC-PLASTIC  PROJECTILE  DUE  TO  NORMAL  IMPACT 

?.  C.  T.  Chen,  J.  E.  Flaherty,  and  J.  P.  Vasllakls 
U.S.  Army  Armament,  Munitions,  and  Chemical  Command 
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ABSTRACT.  A  numerical  study  of  the  dynamic  response  of  an  elastic- 
plastic  projectile  due  to  normal  impact  has  been  made  using  the  finite  element 
structural  response  code  ADINA.  The  projectile  is  a  finite  length  circular 
cylindrical  bar  striking  a  rigid  target.  First,  three  (central-difference, 
Newmark,  Wilson)  direct  integration  schemes  were  used  for  uniaxial  stress  wave 
problems  in  a  linear-hardening  material  and  the  results  were  compared  with  an 
exact  analytical  solution  in  order  to  evaluate  accuracy  and  stability.  Then, 
additional  numerical  results  for  perfectly-plastic  materials  were  obtained  in 
order  to  show  the  effect  of  strain-hardening.  Finally,  some  results  for  a 
multi-linear  material  model  based  on  two-dimensional  elements  are  presented  in 
order  to  show  the  lateral  effect. 

I .  INTRODUCTION.  The  propagation  of  elastic-plastic  waves  in  long  rods 
has  been  treated  extensively  in  the  literature  [1-3]  since  the  pioneering 
works  of  Donell,  Karman,  Taylor,  and  Rskhmatulin.  The  study  of  plastic  wave 
propagation  is  Important  because  it  attempts  to  explain  the  response  of 
materials  to  intense  dynamic  loading  and  serves  also  as  a  basis  for 
determining  dynamic  material  properties. 

Analytical  solutions  can  be  obtained  for  only  a  few  idealized  situations; 
hence,  many  impact  studies  have  been  performed  using  numerical  methods.  Many 
computer  codes  using  either  finite-element  or  finite-difference  approaches 
have  been  developed.  The  computer  simulation  of  impact  phenomena  in  solids  is 
still  quite  involved  and  it  depends  critically  on  the  impact  velocity.  For 
high  velocity  impact  and  penetration  problems,  a  good  review  was  given  by 
Zukas  [3].  For  low  velocity  contact-impact  problems,  many  structural  response 
codes  were  reviewed  by  Noor  [4]. 

In  the  present  paper,  a  numerical  study  of  the  dynamic  response  of  an 
elastic-plastic  projectile  due  to  normal  impact  is  made  using  the  finite 
element  structural  response  code  ADINA  [5].  The  projectile  is  a  finite  length 
cylindrical  bar  made  of  a  high  strength  steel.  The  bar  is  long  and  travels 
with  velocity  V  -  75  m/s  before  it  strikes  a  rigid  target.  First,  three 
direct  integration  schemes  are  used  for  uniaxial  stress  wave  problem  in  a 
linear-hardening  material  and  the  results  are  compared  with  an  exact 
analytical  solution  in  order  to  evaluate  the  accuracy  and  stability.  Then, 
additional  numerical  results  for  perfectly-plastic  materials  are  obtained  in 
order  to  show  the  effect  of  strain-hardening  for  a  multi-linear  material 
model.  Finally,  some  results  based  on  two-dimensional  elements  are  presented 
in  order  to  show  the  lateral  effect. 


II*  ANALYTICAL-  SOLUTION.  The  problem  of  the  normal  impact  of  a  rod 
against  a  rigid  target  has  been  considered  by  many  authors.  Various  schemes 
were  used  for  different  kinds  of  initial  conditions  and  various  material 
properties.  For.  a  linear  work-hardening  material  due  to  sudden  impact,  an 
analytical  solution  for  the  uniaxial  stress  wave  problem  is  available  [1]  and 
it  is  presented  here  for  comparison  with  the  corresponding  ADINA  results. 

Thus,  consider  a  bar  of  length  L  and  diameter  D  that  is  moving  with  velocity  V 
in  the  negative  Z  direction.  At  time  0  the  bar  strikes  a  rigid  wall  Z  *  0 
(Figure  la).  Guided  by  the  stress-strain  curve  for  a  high  strength  steel 
supplied  to  us  [6] ,  the  following  material  data  will  be  used 

E  *  208  GPa,  P  -  0.783  g/cc,  v  -  0.293 

(1) 

Oy  *  1.3  GPa,  Ep  ■  4  GPa, 

where  E,  p,  v,  oy,  and  Ep  are  Young’s  modulus,  density,  Poisson’s  ratio,  yield 
stress,  and  plastic  modulus,  respectively.  The  elastic  and  plastic  one- 
dimensional  wave  speeds  will  then  be  ce  ■  Vg/ p  ■  5154  m/s  and  Cp  *  /Ep/p  m 
715  m/s,  respectively.  The  velocity  Vy  corresponding  to  the  yield  stress  °y 
of  the  material  is 

Vy  **  Oy/(pcg)  *  Cg£y  “  32.21  m/s  (2) 

Both  elastic  and  plastic  waves  will  be  generated  if  the  Impact  velocity  V  > 

Vy.  Let  us  consider  the  case  V  *  75  m/s,  L  -  1.1  m,  D  -  0.1  m.  After  Impact, 
two  shock  wave  fronts  delimit  three  distinct  regions  In  the  bar  (Figures  lb 
and  lc).  The  analytical  solutions  for  the  particle  velocity,  strain,  and 
stress  in  these  three  regions  are 

VQ  *  -V  =  -75  m/s,  e0  »  0,  0o  «  0, 

V *  —V  +  Vy  *  -42.79  m/s,  ■  -ty  “  -0.625%,  aQ  »  -Oy  *  -1.3  GPa, 

V2  ”0,  e,  -  -ey  +  (V y— V ) / Cp  -  -6.610% 

°2  *  "°y  +  Ep(ez+ey)  "  -1.539  GPa  .  (3) 

After  time  t  *  L/ce  =*  213  us,  the  elastic  wave  front  is  reflected  from  the 
free  end.  Behind  this  frout  (Figure  Id),  03  -  63  «  0,  V3  »  2Vy  -  V.  At  time 
tg  *  2L/(ce+Cp)  *  374.85  us,  the  wave  fronts  of  the  plastic  and  of  the  elastic 
unloading  waves  meet  at  the  section  S.  The  stress  and  velocity  are  continuous 
but  the  strain  is  discontinuous  across  this  section  S,  a  no.'propagabie 
discontinuity  surface.  Since  the  inequality 

2ce 

2Vy  <  V  <  (1  +  - )  Vy  «  88.79  m/s  (4) 

Cg+Cp 

is  satisfied,  the  plastic  wave  stops  at  S  and  elastic  waves  again  propagate 
from  S  in  both  directions  (Figure  le).  The  analytical  solutions  in  regions  4 
and  5  are 
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1  ce 

V4  =  V5  =  -  (3  - )(Vy-V)  +  V  -  L3.79  m/s 

2  CP 

1 

e4  =■  ”  <l+cp/ce)(ey-V/ce)  »  -0.473Z 
c 

e5  =*  -  (2  —  +  1  -  ~)(£y-V/ce)  -  -6.342Z 
cp  ce 

°4  “  °5  *  “0.983  GPa  (5) 

Figures  lb  through  le  show  the  locations  of  the  wave  fronts  at  time  t  * 
100,  200,  300,  400  ys,  respectively.  The  analysis  can  be  continued  until  the 
contact  between  the  bar  and  the  target  ceases  at  tc  -  4L/(ce+Cp)  ■  749.7  us. 
More  detailed  information  about  the  analytical  solution  can  be  found  in  the 
book  by  Cristeseu  [11. 

III.  ADINA  SOLUTION.  The  ADINA  code,  developed  by  K.  J.  Bathe,  is  a 
general  purpose  finite  element  program  for  Automatic  Dynamic  Incremental 
Nonlinear  Analysis  [5] *  In  nonlinear  analysis  the  Incremental  finite  element 
equations  of  motion  used  are,  in  implicit  time  integration, 

M  t+Atjj  +  c  t+Aty  +  tO-  t+AtR  -  fcF  (6) 

and  in  explicit  time  integration, 

•• 

m  *£  +  £  c£  +  %  £  “  -  **  (7) 

where  M,  £,  tK,  £R,  t+AtR,  are  constant  mass  matrix,  constant  damping 
matrix,  tangent  stiffness  matrix  at  time  t,  external  load  vector  applied  at 
time  t,  t+At,  nodal  point  force  vector  equivalent  to  the  element  stresses  at 
time  t,  respectively,  and  U  is  the  vector  of  nodal  point  displacement 
Increments  from  time  t  to  time  t+At,  i.e.,  U  *  t+Aty  -  ty.  The  solution  of 
Eq.  (6)  yields,  in  general,  an  approximate  displacement  increment  U.  To 
improve  the  solutior  accuracy  and  in  some  cases  to  prevent  the  development  of 
numerical  instabilities,  it  may  be  necessary  to  use  equilibrium  iteration  in 
each  or  preselected  time  steps. 

T.n  ADINA,  the  central  difference  method  is  employed  for  explicit  time 
integration  and  either  the  Newmark  method  or  Wilson  method  are  employed  for 
Implicit  time  integration.  The  integratin'-  schemes  [7]  are  given  by: 

1 

ty  - - -  (t+Aty  -  2tu  +  t“Aty) 

(At)2 

•  1 

ty  -  -  (t+Aty  -  t-Aty) 

2  At 


(9) 

(9) 


for  the  central  difference  method 
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c+AtU  *  fcU  +  cUAt  +  [(“  “  a)^  +  at+AtU](  At)2 


(11) 


for  the  Newmark  method,  and 


t+Aty  »  tu  +  ( - )(t+0Atu  _  t0) 

0At 


(12) 


0  >  1,  0  <  T  <  0At 


for  the  Wilson  method. 


The  Wilson  and  Newmark  methods  are  unconditionally  stable  if  0  >  1.37  or 
a  >  1/4(1/24*5) ,  6  >  1/2.  In  our  numerical  study,  we  have  chosen  0  ■  1.4,  a  * 
1/4,  6  «  1/2.  In  using  the  central  difference  method,  the  time  step.  At,  has 
to  satisfy  the  Courant  condition 


At  »  At 


cr 


KAfc/c  or  - 

U) 


(13) 


where  Ai  is  the  minimum  mesh  size,  w  is  the  maximum  natural  frequency,  c  is 
the  local  sound  speed  and  K  <  1. 


IV.  NUMERICAL  COMPARISON.  Consider  a  bar  with  the  following  geometrical 
and  material  data?  L  -  1.1  m,  D  -  0.1  m,  E  «  208  GPa,  p  *  0.783  g/cc,  0y  * 
1.3  GPa,  Ep  -  4  GPa,  subjected  to  two  values  *>f  impact  velocity:  V  -  25  m/s 
or  75  m/s.  Since  the  velocity  corresponding  to  the  yield  stress  oy  of  the 


material  is  Vv  -  32.2  m/s,  the  impact  is  elastic  for  the  first  case  and 


elastic-plastic  for  the  second  case.  Analytical  solutions  are  known  for  both 
cases.  We  use  100  one-dimensional  truss  elements  to  simulate  this  uniaxial 
stress  wave  problem,  in  order  to  satisfy  the,  stability  criterion  for  explicit 
integration  by  the  central  difference  method,  we  have  chosen  the  time  step 
At  *•  2ua  which  is  less  than  the  critical  time  step 


At 


cr 


AZ/c, 


2. 
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We  use  three  Integration  schemes  with  v...«  s r-.~  time  step.  The  computations 
are  all  stable  and  the  numerical  results  for  the  axial  stress  and  velocity 
when  V  *  25  m/s  are  shown  in  Figures  2  through  5.  Figure  2  shows  results  for 
the  particle  velocity  and  stress  along  the  rod  at  t  *  100,  200,  300,  400,  500 
Ps  when  the  Wilson  method  was  used.  Figure  3  shows  the  similar  results  for 
the  Newmark  •'wsthod.  The  numerical  results  based  on  these  two  methods  are  less 
accurate  when  compared  with  the  results  based  on  the  central  difference 
method.  Figure  4  shows  a  comparison  of  the  results  for  the  particle  velocity 
along  the  length  of  the  bar  based  on  the  central  difference  method,  the 
Newmark  method,  and  the  analytical  solution  at  t  *  100,  20C,  300,  400  ps.  A 
similar  comparison  between  the  central  difference  method,  and  the  analytic 
solution  for  the  axial  stress  along  the  bar  is  shown  in  Figure  5.  It  should 
be  noted  that  tfye  computations  are  performed  under  the  assumption  that  the  rod 
and  target  remain  in  contact  after  impact  while  the  theoretical  interval  of 
contact  is  tc  -  2L/ce  *  426  Ps. 
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Calculations  were  also  performed  for  elastic-plastic  impact  with  V  *  75 
m/s  using  the  three  integration  schemes  with  the  same  mesh  size  and  time  step. 
A  comparison  of  numerical  results  for  the  axial  stress  and  velocity  using  the 
central  difference  and  Newmark  methods  is  shown  in  Figure  6  at  t  ■  200  ps. 

The  solid  and  dotted  curves  represent  the  results  based  on  the  central 
difference  method  and  Newmark  method,  respectively.  Similar  results  are  shown 
in  Figure  7  at  t  »  400  ps.  As  can  be  seen  from  these  two  figures,  the  central 
difference  method  gives  more  accurate  results  than  the  Newmark  method.  The 
numerical  results  based  on  the  Wilson  method  are  compared  with  those  based  on 
the  central  difference  method  in  Figure  3.  This  also  demonstrates  that  the 
numerical  results  based  on  the  central  difference  method  is  more  accurate.  We 
may  then  conclude  from  this  study  on  elastic  as  well  as  elastic-plastic  impact 
that  the  central  difference  method  will  give  more  accurate  results  than  the 
other  too  integration  schemes. 

V.  HARDENING  AND  LATERAL  EFFECTS.  After  reaching  the  above  conclusion, 
we  use  the  central  difference  method  for  the  rest  of  this  study.  In  order  to 
show  hardening  effects,  we  obtained  numerical  results  for  displacement, 
velocity,  strain,  and  stress  in  a  long  rod  of  an  elastic-perfectly-plastic 
material.  Figures  9  and  10  show  the  results  of  the  particle  velocity  and 
stress  for  a  linear  work-hardening  as  well  as  a  perfectly-plastic  material  at 
t  ■  300  and  400  ps,  respectively.  It  can  be  seen  from  these  comparsions  that 
the  effect  of  strain-hardening  on  the  particle  velocity  and  stress  is  quite 
significant  even  though  the  plastic  modulus  Ep  *  4  GPa  is  small  when  compared 
with  the.  elaelJC  modulus  E  ■  208  GPa. 

In  order  to  study  lateral  effects,  we  have  used  two-dimensional  four-node 
quadrilateral  ring  elements  to  obtain  numerical  results.  We  choose  the  same 
mesh  sice  Ar  *  &z  *  0.011  m  and  use  50  elements  along  the  length  of  the  bar 
with  L/D  *  25.  In  this  arrangement,  the  new  length  of  the  bar  is  only  a  half 
of  the  original.  We  have  used  the  same  time  step  At  *  2  ps  as  in  the  ore- 
dimensional  truss  elements.  This  time  step  yields  stabl;  computations  for 
one-dimensional  truss  elements  but  not  for  two-dimensional  quadrilateral  ring 
elements.  This  seems  due  to  the  lateral  effect  such  as  the  Poisson’s  ratio. 
Including  the  effect  of  Poisson's  ratio  (v  =»  0.293),  the  speed  of  longitudinal 
elastic  wave  is  cj  *  [(E/p)((l-v)/(l+v))/(l-2 v)]*/*  .  5923  m/s,  which  reduces 
to  ce  *  (E/p) 1/2  »  5154  m/8  in  ca8e  0f  v  *  0.  Guided  by  the  stability 
criterion  for  linear  elastic  problems,  we  shall  choose  At  <  Atcr  *  AZ/c^j  ■ 
0.011/5923  sec  ■  1.857  Ps.  For  this  reason,  we  have  carried  the  computations 
for  250  time  steps  using  At  *  1  ps.  The  total  number  of  time  steps  is  the 
same  for  both  the  one  and  two-dimensional  problems.  The  length  of  the  bar  and 
time  increment  for  the  two-dimensional  case  are  only  half  of  the  one- 
dimensional  case.  For  the  two-dimensional  case,  we  have  carried  out  the 
computations  for  impact  velocities  of  V  ■  25  m/s  and  75  m/s.  The  results  for 
>.;.e  elastic  Impact  (V  *•  25  m/s)  are  not  shown  here.  For  elastic-plastic 
impact  (V  *r  75  m/s),  we  have  carried  out  the  computations  for  a  bilinear  as 
wel’  as  a  multi-linear  material  model.  Seven  points  are  used  to  represent  the 
stress-strain  curve,  i.e.,  ( o  in  GPa,  e  in  i)  *  (1.3,  0.63),  (1.355,  1.03), 
(1.38,  1.83),  (1.394,  2.63),  (1.415,  4.23),  (1.43,  5.83),  (1.45,  8.63).  The 
numerical  results  for  the  case  of  a  multi-linear  material  ate  shown  in  Figures 
11  through  14.  ‘Figure  11  shows  the  effective  stress  and  axial  velocity  along 
the  length  of  the  bar  at  the  end  of  50  and  100  time  steps.  Curves  1  and  2 
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represent  the  results  at  t  =  50  and  100  ys,  respectively.  Figure  12  shows  the 
axial  stresses  along  the  length  of  the  bar.  We  have  2x2  stations  to  carry  out 
the  numerical  integration  in  the  spatial  direction  in  each  element.  The 
results  for  the  stresses  are  calculated  at  four  integration  stations.  As  can 
be  expected  from  a  long  rod  (L/D  »  25),  the  differences  in  the  results  along 
the  stations  near  the  centerline  or  outside  are  very  small,  figures  13  and  14 
shew  the  similar  results  for  the  effective  stress,  particle  velocity,  and 
axial  stress  along  the  length  of  the  bar  at  t  *  150  and  200  us.  Aa  can  be 
seen  in  these  two  figures,  the  axial  stress  in  the  plastic  zone  shows  bigger 
oscillations  than  corresponding  effective  stress.  Comparing  the  results  shown 
in  Figures  11  through  14  with  the  one-dimensional  results  shown  in  Figures  6 
through  8,  we  conclude  that  the  transition  near  the  wave  front  is  not  as  steep 
as  the  one-dimensional  case  and  dispersion  behind  the  wave  front  can  be 
observed.  Some  of  the  oscillations  are  real  due  to  lateral  effects  such  ae 
radial  inertia,  radial  shear,  etc.,  but  some  are  due  to  numerical  errors  such 
as  truncation  error  in  the  finite  element  system,  approximations  in  time 
integration  schemes,  numerical  integration  in  the  spatial  directions,  etc.  It 
is  difficult  to  identify  how  much  of  the  oscillations  are  real  and  how  much 
are  due  to  numerical  error.  We  hope  to  develop  a  numerical  model  to  minimize 
the  numerical  error  for  this  purpose  while  trying  to  improve  the  theoretical 
model. 

VI  CONCLUSION.  Based  on  our  numerical  study  for  uniaxial  stress  wave 
problem  in  a  linear-hardening  material,  the  central  difference  method  gives 
more  accurate  results  than  the  Wilson  and  Newmark  methods.  The  effect  of 
hardening  is  significant  and  lateral  effects  due  to  radial  motion  needs 
further  study.  We  plan  to  improve  the  theoretical  model  and  to  develop  a 
numerical  scheme.  We  hope  to  compare  our  numerical  results  with  experiments 
involving  normal  impact  of  cylindrical  rode. 
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Figure  9.  The  effect  of  hardening  on  V  and  a  at  t  *  300  pe. 
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I .  INTRODUCTION .  This  investigation  is  concerned  with  the  propagation 
of  shocks  and  phase  boundaries  in  elastic  solids.  Attention  is  restricted  to 
one-dimensional  motions?  for  simplicity,  imagine  a  bar  in  which  transverse 
displacements  are  absent.  The  problem  is  introduced  in  section  II,  where  it 
is  reviewed  how  change  of  phase  phenomena  can  be  modelled  by  means  of  a 
nonmonotonic  stress-strain  law.  These  laws  have  been  studied  previously  in 
[1]/  [2],  [3];  a  relevant  experimental  study  is  [4].  This  section  also 
treats  the  simple  wave  that  develops  whenever  a  nonzero  load  ob  is  suddenly 
applied  to  the  end  of  the  bar.  This  simple  wave  would  be  expected  to  mirror 
in  some  tashion  the  ultimate  state  of  affairs  whenever  the  bar  is  gradually 
loaded  at  one  end  to  the  level  a provided  waves  are  not  subsequently 
reflected  back  from  the  opposite  end.  Issues  involved  in  such  an  asymptotic 
study  are  discussed  in  the  third  section.  Section  IV  addresses  special 
considerations  for  materials  in  which  the  stress-strain  law  is  piecewise 
linear.  Then,  in  section  V,  we  carry  cut  an  asymptotic  analysis  for  an 
example  problem  involving  such  a  material* 


II.  FORMULATION  OF  THE  PPOSLF:.  AND  THE  RIEMANN  SOLUTION  FOR  IMPTJLSIVE 
LOADING.  Consider  a  homogeneous,  semi-infinite  elastic  bar  which  occupies 
x  >  0  in  a  reference  configuration.  Pure  longitudinal  motion  is  governed  by 
the  momentum  equation 


(2.1) 


32u  3a 
3t2 


where  \i  =  u(x,t)  is  the  longitudinal  displacement  of  the  bar  and  a  is  the 
stress  alone,  the  axis  of  the  bar.  The  density  in  the  reference  state  is  taken 
to  be  one.  Let 


(2.2) 


denote  respectively  the  strain  and  velocity  in  the  bar.  For  elastic 
materials,  the  stress  at  time  t  at  a  location' which  was  originally  at 
position  x  is  completely  determined  by  the  value  of  e(x,t)  by  meanp  of  the 
constitutive  relation  a  =  a(e)  .  The  sound  speed  of  the  material  can  be 
identified  by  expressing  (2.1)  in  characteristic  form  and  is  found  to  be 

/a'(£)  .  Here  the  *  symbol  is  the  usual  shorthand  notation  for 
derivative.  We  shall  focus  attention  on  a  hypothetical  material  for  which  the 
stress-strain  law  a(e)  is  given  by  the  smooth  curve  in  Pig.  1.  Here 
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0(0)  ■  C  and  we  shall  restrict  attention  to  positive  values  of.  a  and  e. 
This  entails  no  loss  in  generality  since  compressive  motions  can  be  treated  by 
considering  a  corresponding  e.ctensional  problem  for  a  material  with  a  stress- 
strain  curve  found  by  reflecting  the  original  curte  through  the  origin. 

Unlike  the  ctate  of  affairs  in  gas  dynamics,  this  technique  gives  correct 
results  e*ren  in  the  presence  of  shocks  t5]  . 


Fig,  1.  Stress-strain  curve 


Restricted  stress-strain  curve 
indicating  the  jump  associated 
with  a  change  of  phase 


The  descending  portion  of  the  curve  in  Fig.  1  is  its  most  conspicuous 
feature.  Tu  an  equilibrium  setting,  strains  associated  with  this  portion  are 
found  to  be  unstable  [  1  ] .  Ir.  the  nonequilibrium  case  the  sound  speed  is 
imaginary  and  the  equation  of  motion  is  elliptic  at  these  strains.  These 
difficulties  can  be  overcome  by  precluding  these  strains.  This  is 
accomplished  in  a  natural  fashion  by  considering  n  inverse  to  the  o(e) 
relation,  namely  strain  as  a  furction  of  stress.  Since  the  o(e)  relation  is 
not  -lonotonic,  there  ate  innumerable  such  inverse  functions.  We  take  the 
particular  inverse  that  leads  to  the  clipped  or  restricted  curve  as  shown  in 
Fig.  2.  The  separated  strain  intervals  are  now  associated  with  different 
material  phases.  Hero  the  value  of  the  transit!' n  stress  0^  is  assumed  to 

be  a  property  of  the  material  (see  (*•])<  Several  other  methods  for  dealing 
with  non-monotonic  constitutive  laws  have  also  been  studied  (see  f6],  [7], 
18]). 

If  a  load  oQ ( t '  >  0  is  applied  to  the  and  of  the  bar  beginning  >.t 
time  t  »  0,  (2.1)  is  to  be  so?\ed  subject  to 


(2.3) 


(2.4) 


3(e(0,t) }  «  o0  (t)  , 


t  »  C  , 


e(x,0)  *  0  , 


v(x,("  *  0, 


x  >  c 
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The  condition  (2 
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has  solutions  £  for  each  value  of  X  on  the  interval  [/o' (e^) ,  /i' (0)J . 

Whenever  3(e)  contains  line  segments,  (2.9)  will  have  an  interval  of 

solutions  in  e  for  certain  values  of  X,  say  X  ^  X,  ...  X  .  At  all  other 

■  n 

values  of  X.  (2.9)  uniquely  determines  a  function  s(X).  Next,  e(X)  is 


and  e(X)  =  e  for 


This  construction  orders  the  sound  speeds  of  the  rays  x  =  Xt  in  such  a 
manner  that  values  of  strain  associated  with  higher  sound  speeds  are  located 
further  down  the  oar  compared  to  those  strain  values  vith  lower  sound 
speeds.  In  the  process  it  naturally  positions  shock  and  phase  boundary  curves 
x  =  X^t,  as  well  as  ensuring  the  correct  values  of  e  on  the  x  and  t 

axes.  By  virtue  of  (2.1),  (2.7),  the  function  v(X)  is  found  from  e(X) 
through 

0  X  >  X„ 


(2.10)  v( X )  =  _  s (X ) 

v(Xj)  -  /  /o' ( s )  ds 


e(X-) 


Xi+,  <  X  <  \  , 


(i  —  1  ...n,  X  .  —  0 ) 
;>+1 


where 


v(Xr)  =  v(.\j)  +  XjeUj)  -  e(Xj)] 


The  solution  (2.8)  of  the  impulsive  load  problem  thus  consists  of  a 
partitioning  of  the  quadrant  x  >  0,  t  >  0  into  sectors  by  rays  through  the 
origin.  Across  these  rays  the  solution  may  be  either  smooth  or 
discontinuous.  In  the  sectors,  the  strain  and  velocity  fields  are  either 
constant  or  arise  from  solutions  of  (2.9).  In  both  :ases  it  can  be  shown 
that  at  least  one  of  the  two  Riemar.n  invariants  associated  with  (2.1) 
remains  constant.  This  being  the  case,  the  solution  is  said  to  be  a  simple 
wave. 


It  is  worth  mentioning  that  if  at  some  time  t^  >  0  the  bar  is 

subsequently  impulsively  unloaded  back  to  a  =  0  ,  the  wave  pattern  /hich 
results  v  oefore  any  interactions  with  the  loading  waves  whi-h  at  t-j  are 
further  down  the  bar)  can  be  found  by  an  analogous  construction  involving 
lower  convex  envelopes  of  0(e)  . 

III.  NONIMPULSIVE  LOADING.  Whenever  the  load  ap(t)  is  not  simply  a 

positive  constant,  the  solution  of  the  system  (2.1)  -  (2.7)  is  not  a  simple 
wave  similarity  solution  like  that  of  the  previous  section.  Instead  one  must 
take  account  of  both  families  of  characteristics  associated  with  (2.1). 
Suppose,  however,  that  the  applied  load  eventually  .> '.tains  or  merely 
approaches  to  final  value  .  In  this  event,  one  expects  the  corresponding 

simple  wave  solution  with  0Q(t)  =  am  to  give  the  ultimate  number  of  shocks 


V.  *  u 


i  A.  *  fL 
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and  phase  boundaries,  and  to  yield  the  correct  limiting  order  of  the  waver  in 
the  bar  as  t  tends  to  infinity.  The  spacing  of  these  waves  will  depend  on 
the  manner  in  which  oQ(t)  ■*  am,  nevertheless  the  value  of  the  dynamical 

fields  in  the  limit  A  =x/t  fixed,  t  ♦  09  is  given  by  the  simple  wave 

solution . 


The  approach  to  the  asymptotic  state  may  be  studied  by  decomposing  the 
displacement  u  into  the  corresponding  simple  wave  solution,  and  a  correction 

to  be  denoted  by  u  .  By  virtue  of  (2.8),  (2.2)  the  former  is  expressed 

A 

uQ(A)t;  the  latter  u  is  assumed  to  be  o(t)  as  t  ♦  X  fixed. 

Thus 

A 

(3.1)  u  =  uQ(A)t  +  u(A,t)  , 

while  the  governing  equation  (2.1)  becomes 


,  ,  2A  A"  2A  A  .,1x1 

<3-3)  3  ux  +  3  u«  '  -  +  utt  •  "  It  "x>  75  " 


AA 


Here  subscripts  of  A  and  t  denote  partial  differentiation.  Entering 
(3.2)  with  (3.1)  one  obtains  to  leading  and  second  order 


(3.3) 


0  = 


+ 


*'<uJ)]J 


+  terms  higher  than  second  order. 


In  arriving  at  the  above  result  it  is  necessary  to  make  use  of  the  expansion 

a'(T  V  ~  o’(u’)  +  a-(u')  {ux. 

The  expression  in  the  last  parenthesis  of  (3.3)  dominates  all  succeeding 
expressions  for  large  times  and  so  must  independently  vanish.  Thus  one  draws 

(3.4)  u&(A)  =  ciX  +  c2 
for  constants  c^  ,  C2  »  or 

(3.5)  uq(A)  =  JX  c’"1(s2)ds 

where  the  superscript  ”1  denotes  functional  inverse.  It  is  easily  verified 
that  (3.4)  gives  rise  to  the  constant  strain  and  velocity  fields  of  the 
impulsive  load  problem,  while  (3.5)  corresponds  to  solutions  of  (2.9).  The 
Riemann  solution  discussed  in  the  previous  section,  is  the  unique  way  in  which 
it  is  possible  to  assemble  these  solutions  '3.4)  and  (3.5)  in  a  manner 
which  satisfies  (2.4),  (2.7),  oQ(t)  =  o M  and  also  jumps  over  the  specified 
interval  (a,t$). 
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The  nature  of  the  approach  to  this  simple  wave  is  governed  by  the 
expression  in  the  second  parenthesis  of  (3.3).  Requiring  this  expression  to 
vanish  leads  to 


(3.6)  -2A  u,  +  t  u..  =  0 

At  tt 

in  the  case  of  (3.5),  while  for  (3.4)  it  is  easiest  to  revert  back  to 
independent  variables  x  and  t  whereupon  one  finds  that  the  equation  can  be 
written 


u  =  0 '  ( c  )u 
tt  1  xx 


The  latter  has  the  familiar  solution 


(3.7)  u  =  a(x  -  /o’  (c.j)  tj  +  B^x  +  /oMc^  tj  , 

for  any  functions  A(  • )  ar.d  B(  • ) .  The  former  (3.7)  is  also  easily  solved 

(3.8)  u  =  —  C(/X  t)  +  D(A)  =  t  C(xt)  +  D ( x/t ) 

/A 

for  any  functions  C ( * )  and  D(*).  Unfortunately  the  boundary  condition 
(2.3)  ca..not  be  used  directly  for  determining  the  free  functions  which  have 
emerged  from  this  treatment.  Inr'-ead  one  anticipates  appropriate  conditions 
to  arise  from  a  more  penetrating  analysis  of  various  short-  and  intermediate  - 
time  solutions  to  the  problem,  each  of  which  is  appropriate  in  a  different 
region  of  the  (x,t)  -  quadrant.  In  general  one  expects  to  continue  the  fields 
between  these  as  yet  unknown  regions  through  a  detailed  matching  layer 
analysis.  Rather  than  pursuing  this  program,  a  method  appropriate  to  certain 
special  materials  will  be  introduced. 


IV.  PIECEWISE  LINEAR  STRESS  RESPONSE  We  turn  now  to  a  class  of  model 
materials  in  which  the  stress-strain  curve  consists  of  a  number  of  linear 
segments.  Such  models  are  often  associated  with  the  theory  of  plasticity, 
nevertheless  here  we  shall  assume  that  loading  and  unloading  follow  the  same 
stress-strain  curve. 


The  fields  arising  from  an  impulsive  load  can  be  found  by  the  procedure 
discussed  in  section  2.  As  before,  each  line  segment  in  the  upper  convex 

a 

envelope  o(e)  of  0(e)  is  associated  with  a  front  across  which  the  strain 

A 

suffers  a  discontinuity.  For  the  materials  now  under  consideration,  0(e) 
will  consist  of  nothing  but  line  segments.  It  is  convenient  to  distinguish 
among  three  types  of  discontinuity  fronts  by  drawing  a  distinction  between  the 

A 

line  segments  comprising  o(e).  We  shall  say 

a  phase  ooundary  is  a  front  which  is  associated  with  a  line  segment 

which  spans  a  clipped  portion  of  the  original  curve 
0(e)  , 


a  contact  discontinuit 


is  a  front  which  is  associated  with  a  line  segment 
which  coincides  with  the  original  curve  0(e),  and 
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a  shock  is  a  front  which  is  associa  ed  with  a  line  segment 

which  neither  coincides  with  the  original  curve,  nor 
spans  a  clipped  portion  of  the  curve. 

The  solution  of  the  impulsive  load  problem  will  consist  of  partitioning  of 
the  (>:,t) -quadrant  into  sectors  of  constant  strain  and  velocity.  These 
sectors  are  separated  by  either  shocks,  phase  boundaries  or  contact 
discontinuities . 

We  now  inquire  as  to  what  is  the  relation  between  this  simple  wave 
solution  and  the  fields  which  would  be  produced  if  the  bar  were  gradually 
loaded  to  the  level  o^?  As  before,  the  simple  wave  solution  will  yield  the 
limitincr  order  of  waves  in  the  bar  as  t  tends  to  infinity.  It  will  also 
produce  Lhe  correct  final  number  and  ordering  of  shocks  and  phase 
boundaries.  The  contact  discontinuities,  however,  are  not  associated  with 
true  curves  of  discontinuity  in  the  field  variables.  Instead  they  indicate 
nondispersive  wave  packets  across  which  the  dynamical  fields  gradually  change. 

The  approach  to  this  simple  wave  can  be  examined  by  exploiting  the 
linearity  of  the  material  in  the  different  strain  intervals.  The  difficulty 
lies  in  determining  the  regions  in  the  (x,t) -plane  in  which  the  strain  takes 
values  in  the  individual  intervals.  The  boundaries  of  these  curves  must  be 
either  curves  of  constant  strain  or  curves  acrosj  which  the  strain  jumps 
between  values  from  different  intervals.  In  either  case  these  conditions  lead 
to  functional  equations  when  the  strains  are  expressed  in  terms  of 
D'Alembert’s  solution  to  the  linear  wave  equation. 

In  order  to  illustrate  this,  we  consider  the  material  of  Fig.  5.  The 
particular  form  of  this  stress-strain  curve  is  motivated  by  its  similarity  to 
the  material  previously  introduced  in  Fig.  1.  W'>  shall  suppose  that 

lim  aQ(t)  a  wh^re  am  is  as  indicated  m  Fig.  5.  The  corresponding 

t-K» 

impulsive  load  solution  is  also  given  in  Fig.  5.  This  solution  consists  of  a 
phase  boundary  and  a  contact  discontinuity  separating  regions  in  which  the 
dynamical  fields  are  constant.  Suppose  further  that 

0Q(U)  =  0  ,  o0’(t)  >  0  . 

We  shall  let  x  =  r(t)  denote  the  phase  boundary,  while  x  =  q(t)  shall 
denote  the  curve  upon  which  the  strain  has  value  y,  thus 

(4.1)  e(q(t),tj  =y. 

Let  Aq,  A ^ ,  A2  denote  the  regions  in  the  (x,t) -plane  in  which  the 
strain  lies  in  the  respective  intervals  (0,a),  (6,y),  (u,0*).  In  each 

32e  2  32e 

region  A;  the  strain  obeys  the  equation  — r  -  c  — —  which  implies  that 

3tZ  1  3xZ 

(4.2)  e(x,t)  =  fi(x  +  cit^  +  g^-x  +  cit) 

for  as  yet  undetermined  functions  f  a<~d  g.  The  velocity  in  A,  must  then 
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be  v(x,t)  =  c^f^(x  +c^t)  -  c^g^(-x  +  c^t) .  The  and  g^,  along  with 

s(t)  and  q(  t)  /  are  eight  unknown  functions,  each  of  a  single,  argument. 

They  must  satisfy  eight  conditions  given  by:  the  two  initial  conditions 
(2.4),  the  boundary  condition  (2.3),  the  two  shock  conditions  (2.7),  two 
conditions  stemming  from  (4.1),  and  a  condition  that  the  velocity  is 
continuous  across  x  =  q(t).  The  last  three  conditions  in  the  above  list  give 
rise  to  functional  equations  since  q(t)  appears  as  an  argument  of  f-j,  f2, 

• 

,  g2*  Moreover,  since  s(t)  appears  in  (2.7),  the  condition  on 

x  -  s(t)  furnishes  a  pair  of  functional  differential  equations,  in  the 
expressions  which  follow,  we  shall  employ  parenthesis  solely  to  indicate  the 
argument  of  a  function. 

From  the  initial  conditions  (2.4)  it  can  be  shown  that  the  Riemann 
irvar.iant  associated  with  the  characteristics  —■  =  -c.  is  identically  zero 

ut  v) 

on  Ag.  Thus  fg(z)  H  0  which,  by  virtue  cf  (4.2),  (2.5),  yields 

ec(z/c0)  7.  >  0  , 

Vz)  - 

0  z  <  0  . 

The  other  six  unknown  functions  do  not  admit  rimple  solution 
representations.  In  what  follows  we  shall  restrict  attention  to  large 
times.  The  analysis  will  be  shortened  considerably  by  taking  advantage  of  the 
simple  wave  solution  depicted  in  Fig.  5.  This  is  not  necessary;  the 
asymptotic  simple  wave  could  be  deduced  from  the  analysis.  Such  a  program, 
however,  requires  the  consideration  -  too  lengthy  to  be  included  here  -  of 
numerous  possible  cases. 

V.  LARGE-TIME  DYNAMICAL  FIELDS  The  boundary  condition  (2.5)  can  be 
incorporated  directly  into  the  functions  f2,  g2*  In  placr  of  the  functions 

f),  g1;  f2,  g2  we  shall  instead  employ  i  ,  h  through  the  expressions 


(5.1; 


(5.2) 


e(x  t)  * 


v(x ,  t )  = 


f  (  X  + 

Cjt)  + 

g(-x  +  c^t) 

in 

A1 

h(x  + 

c2t)  - 

h(-x  +  c2t)  +  eQ(t  -  x/c2) 

in 

A2 

c^f(x 

+  C^t) 

-  c^g{-x  +  c^t) 

in 

A1 

c2h(x 

+  c2t) 

+  c2h(-x  +  c2t)  -  c2eQ(t  -  x/c2) 

in 

A2 

The  five  unknowns  f,  g,  h,  s,  q  are  to  be  detrmined  from:  two  conditions 
holdi  g  on  x  =  q(t)  which  stem  from  (4.1),  a  condition  expressing  the 
continuity  of  velocity  on  x  =  q(t),  and  the  two  shock  conditions  (2.7) 
which  hold  on  x  =  s(t).  The  lirst  two  of  the  conditions  lidding  on  x  * 
q(t)  become 

(5.3)  f(q(t)  +  c1t)  +  gi-q(t)  +  c^t)  -  U  , 

(5.4)  h(q( t)  +  c2t)  -  h(-q(t)  +  c2t)  +  eQ(t  -  q(t)J  «  U  . 
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With  the  aid  of  these  two  equations,  the  continuity  of  velocity  condition  may 
be  written  as 


c0  c 

(5.5)  f(q(t)  +  CntJ  =  ~  h[q(t)  +  C2t)  +  1  (1  -  -±  Jy 


The  two  shock  conditions  can  be  manipulated  into  the  form 


D1 

(5.6)  |c1  +  s(t)]f(s(t)  +  c^tj  +  =  0  ,  t  sufficiently  large. 


D1 

(5.7)  [ c^  -  s(t)]g(-s(  )  +  cifc)  +  2c~  =  9  '  sufficiently  large  . 


The  phrase  "t  sufficiently  large"  indicates  that  these  are  the  equations 
which  hold  once  the  phase  boundary  x  =  s(t)  has  become  the  leading 
disturbance.  This  eventuality  follows  from  the  simple  wave  sol  tion,  moreover 


the  simple  wave  solution  also  indicates  that  s(t)  /o(u)/u  t  and 


q(t)  c 


Assume  that  the  wave  and  front  speeds  obey  the  ordering 


(5.8) 


c1  >  s(t)  >  c2  >  q( c)  >  0 


for  t  sufficiently  large.  One  may  conclude  directly  from  (5.3)  -  (5.7) 
that 


f(z)  *  fw  ,  g(z)  ,  h(z)  -*•  h.o 


as  z  “  , 


:(t)  a  ,  q(t)  +  3 


as  t  00 


The  values  for  f  .  g  .  and  a  are  obtained  from  a  consideration  of  the 

CO  •'00 ' 


equations  (5.3),  (5.6),  (5.7)  for  large  times.  As  t  •*■  «*  these  equations 
become  respectively 


D1  D1 

f«»  H  =  11  '  [c-j  +  +  2^  =  °  '  [C1  "  a]g»  +  ~2c~^ 


which  give 


rR  Q,  _  /2  D1  /Q(W)  . 

“  *9)  “  ^C1  +  u  ^  y  '  f®  2^1^  +  ' 


9»  2c-jtci  "  ’ 


The  values 


(5.10) 


h  =  —  f  -  ~  [-—  -  lju  =  5  li[l  -  t~]  ,  3  =  c  , 

00  00  2  Lc^  J  2  L  c  J  2 
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K* 

Si 

Pj 

li 

follow  from  (5.4),  (5.5) 

If 

it  is  immediate  that  Aq, 

li 

solution  displayed  in  Fig 

P 

into  (5.1),  (5.2),  the  1 

E" 

are  recovered. 

t! 


if- 


In  order  to  study  the  approach  to  these  values  let 


3  (  t  >  =  +  S^t) 


q(t.)  =  C2t  +  q^t) 


(5.11) 


f(z)  =  f»  + 


giz)  =  +  91  (2) 

h(z)  =  h  +  h. (z) 


for  new  functions  s^,  q-j,  f-j,  g-j ,  h-j.  Upon  substituting  from  these 

expressions  into  '5.3)  -  (5.7)  and  considering  a  balance  of  the  second  order 
terms,  one  obtains  asymptotic  expressions  for  these  functions.  This  procedure 
applied  to  (5.6)  yields 


L»1 

0  =  [c(  +  a  +  S^t)]  {f„  +  f1([c1  +  o]t  +  s^t)}}  +  — 
=  f  1  ( t )  +  [c1  +  a  +  s1(t)]f1([c1  +  a)t  +  s^t)) 

-  s ^ ( t ) fm  +  [c„  +  a] f 1 ( [c1  +  alt)  , 
which  in  iurn  implies 

(5.12) 

Similarly  (5.')  leads  to 

(5.13) 


[c  +  a] 

s^t)  - - £ -  ^1^c1  + 


tc1  " 

s  (t) - - -  g,i  tc1  -  alt)  , 


while  (5.3)  leads  to 

(5.14)  f^tc,  +  c2lt)~  -  g^ic,  -  c2]t) 


i*’ 


L 


L 

1- 

■> 


l 

& 


CI  WT7  ■^VTV'*'.’  rV  V 


obeys 


’1 


and  s-j 

between 

(5.12 

)  (5.13) 

(5 

I2' 

.v 

"c1  +  0" 

2f  (\ 

r°,  +  “i 

\ 

C2 

7 

c1  -  a 

\ 

I  ci  -  a 

L_ 

as  2  +  *» 


f1(k2z)  ~  k1f1<z) 


This  condition  will  be  written 
(5.15) 
where 

12 


as  z  +  “ 


(5.16) 


L1 


r°)  -  ai 

c  +  a 

i 

L  1  J 

(5 

.8)  that 

tc!  +  «][<=!  "  °2] 

k2  =  F7“  aj'['c,  +  c2J  * 


k_  >  1  >  k.  >  0  . 

C.  I 


(5.17) 

The  asymptotic  expression  (5.15)  indicates  that 

(5.18)  f^(z)  ~  Az11  ,  n  =  In  k1/ln  k2 


where  A  is  undetermined.  In  light  of  (5.17),  it  follows  that  n  <  0  so 
that  f  1  ( z)  is  indeed  dominated  by  whenever  z  +  »,  as  was  assumed  in 


the  development  leading  to  (5.15).  Moreover  the  following  argument 
demonstrates  that 


(5.19) 


n  <  -2  . 


Proof:  (5.18)  implies  (k^2k2)n  =  k^  ^2T‘,  which#  in  conjunction  with 


V? 

<  k/2 


k_  = 


c  —  c 
1  5 


.  ...  -  <1  and  n  <  0  yields  k. 

1  2  c.  ^  c  1 


k.1  +^n  >  1. 


This  last 


'1  ~2 

result,  along  with  (5.17),  yields  1  +  V2  n  <  0,  which  is  (5.19).  / 


One  finds  from  (5.18),  (5.12),  (5.13)  that 


-ll+SJ  Ic,  +  aln+1 

(5.20)  g.j(z)'-  k1  Az  '  ,  s^t) - j -  At 


On  account  of  (5.20)  and  (5.19)  one  has 


(5.21) 

where  s0  is  undetermined. 


n+1 


s1(t)  ~  SQ  "  (n+1 )f 


At 


n+1 
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The  asymptotic  behavior  of  the  corrections  h^fz)  and  g-j(t)  are  found 

from  the  remaining  equations  (5.4),  (5.5).  Entering  (5.5)  with  (5.11), 
(5.10)  one  obtains 


hi (2°2t  +  q.,^)  -  f1Uc1  +  °23t  +  q1(t>) 


from  which  it  follows  that 


m 


(5.22) 


c  c  +  c  n 

h(z)^J.{  - ]  ^  . 

i  c2  *C2 


Equation  (5.4)  is  the  most  delicate  in  the  analysis.  The  exact 
equation  yields 

(5.23)  V  -  hw  +  h1(2c2t  +  q1  (t)  -  h(~q,j(t))  +  eQ(-q1  (t )/c2)  . 

In  this  equation  h1(2c2t  +  q^tt))  =  o(1)  as  t  +  *».  It  is,  however,  not 
appropriate  to  expand  h(-g^(t))  by  means  of  (5.11)  unless  q^(t)  + 
Assume  for  the  moment  that  q, (t)  +  -**j  then  e0(-q^ (t)/c2)  +  ew, 
h(-q.j(t)}  +  h^  so  that  (5.22)  gives  up  *  y.  Since  £,,,>11  the 
assumption  is  tal.-e.  Instead  (5.23)  demands  that  q1(t)  +  q0  >  with 
Vi  -  h^,  -  hf -qQ)  +  e0(-q0/c2).  In  the  event  that  h(z)  admits  a  Taylor 
expansion  at  z  -  -qg,  then  (5.22),  (5.23)  yields 


+  “  S("q0/c2^  t"q(t)  +  q0^  “  hi(2c2t)  -  [c1/c2](c1  +  c2lnAtn  . 

This  in  turn  Implies  that  q(t)  ~  qg  +  0(tn)  wr.enever  the  coefficient  in 
parenthesis  is  not  zero.  Collecting  the  results  from  this  section  we  have 


f(z)  =  f  +  Azn  +  o(zn) 

w> 


(5,24) 


g(z)  =  gw  +  Ak1 


-<1  +  f)  n 

z  +  o(z  ) 


1  c  ^  +  a 


s(t)  •=  at  +  s. 


n+1 


(n+1 )f 


n+1  ..n+1 

At  +  oft  ) 


h(z)  =  h_  +  A 


C1[C1  *  C2] 
_n  n+1 
2  °2 


q(t)  =  c2t  +  qQ  +  Oft  ) 


n  ,  .  n, 

z  +  o(z  ) 


f 


y* 


‘i 


The  constants  £tt,  h^,  gM,  a,  k^,  n  are  given  in  (5.9),  (5.10),  (5.16), 
(5.18).  The  constants  A,  qQ,  sg  are  as  yet  unspecified.  Indeed,  in  order 
to  assign  values  to  A,  q0,  s0  one  may  surmise  that  it  is  necessary  to  take 
account  of  the  complete  loading  history  c^ft).  Consider,  for  example,  the 
constant  Sg.  The  phase  boundary  x  =  s(t)  asymptotically  approaches  the 
line  x  =  at  +  3^.  This  line  issues  from  the  t  axis  at  time  t  =  -s^/a,  s 
that  Sg  defines  a  time  scale  for  the  problem.  The  only  source  of  such  a 
time  scale  lies  in  the  applied  load  a^(t). 

The  dynamical  fields  are  obtained  by  substituting  from  (5.24)  into 
(5.1),  (5.2).  This  yields 


Ac  (c  +  c  ] 

s(x,t)  ~  £  +  - - - -  ------ 

r  n+1 

2 


v(x,t)  ~  -  ay  -  c ,[ea  -  yj 


[c2  +  f  ]  -  [c2  -  f  ]  tn  , 


.  Scllc1  +  c2]  ,  .  X  , 

*  — -T-Si-  lc2  *  t  !  ••  l' 

2  c2 


-f  J  tn 


in  region  A2«  In  region  A-j  one  obtains 


e(x,C)  ~ 


~i 

c.  +  a  n+2  n 

♦  r  r  ♦  i  (.,-fi^t". 


v(x,t.)  ~  -ay  + 


n  c  +  a  n+“  n 

nc,  (c,  +  i  J  -  (J -  ]  [c,  -  i  ]  tn 

1  1  1  t  J  Lc1  -  a  1  L  1  t  J 


Although  the  value  of  A  is  not  found  from  this  analysis,  the  exponent  a 
governing  the  rate  approach  to  the  simple  wave  is  determined  from  both  the 
stress-strain  behavior  of  the  material  and  the  ultimate  level  of  the  applied 
load  by  means  of  (5.18). 


to; 
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ABSTRACT.  When  a  two-dimensional  elastic  body  which  contains  a  notch  or 
a  crack  is  nnder  a  plane  stress  or  plane  strain  deformation,  the  asymptotic 
solution  ot  the  stress  near  the  apex  of  the  notch  or  crack  is  simply  a  series 
ot  eigenfunctions  of  the  torsi  ?®f (^,8)  in  which  (p,^)  is  the  polar  eoordi-. 
nate  with  origin  at  the  apex  and  6  is  the  eigenvalue.  If  the  body  is  a 
three-dimensional  elastic  solid  which  contains  axisymmetric  notches  or  cracks 
and  subjected  to  an  axisymmetric  deformation,  the  eigenfunction  associated 
with  an  eigenvalue  contains  not  only  the  p*>  term,  but  also  the  p®+1, 
P°  ...  terms.  Therefore,  the  second  and  higher  order  terms  of  the  asymp¬ 
totic  solution  are  not  simply  the  second  and  subsequent  eigenfunctions.  We 
present  the  eigenfunctions  for  transversely  isotropic  materials  under  an  axi¬ 
symmetric  deformation.  The  degenerate  case  in  which  the  eigenvalues  p^  and 
P2  of  tlie  elasticity  constants  are  identical  is  also  considered.  The  latter 
includes  the  isotropic  materials  under  axisymmetric  detonations. 


1  INTRODUCTION,  It  is  well-known  that  the  stress  distribution  near  the 
apex  ot  an  isotropic  elastic  wedge  or  a  notch  under  a  plane-stress  or  plane- 
strain  detonation  can  be  expressed  in  tens  of  a  series  of  eigenfunctions  ot 
the  ton  p®f(^,8)  where  p  is  the  radial  distance  from  the  apex  and  f  is 
a  function  of  the  polar  angle  f  and  the  eigenvalue  6,  ri,2].  For  given 
wedge  angle  and  homogeneous  boundary  conditions  at  the  sides  of  the  wedge, 
there  are  in  general  infinitely  nany  eigenvalues  6  and  the  associated  eigen¬ 
functions  p®f(^,6).  Particularly  iaqportant  in  applications  is  when  one  or 
more  ot  the  6's  is  negative  and  the  stress  is  singular  at  the  apex.  For 
instance,  if  the  specimen  shown  in  Fig.  1  represents  a  two-dimensional  body 
under  an  external  loading,  6  =  -1/2  at  the  crack  tip  Q.  At  points  R,  N 
and  M,  it  can  be  shown  [3J  that  there  are  two  negative  6's.  Hence  the 
stress  is  singular  at  points  Q,  R,  N  and  M.  In  solving  stress  distribution 
in  the  entire  specimen  numerically  by  a  finite  element  scheme,  one  may  use 

regular  finite  elements  everywhere  except  at  the  singular  points  Q,  R,  N  and 

M.  At  these  singular  points,  a  special  element  is  used  in  vnich  the  singular 
nature  of  the  stress  is  given  by  the  analytical  expression  p*f(^,6).  It  may 

be  sufficient  to  consider  only  the  first  term  (or  terms)  for  which  &  is 

negative  in  the  special  element.  In  many  cases,  however,  more  terms  including 
those  associated  with  positive  6  are  required  [4,5]. 
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In  this  paper,  we  consider  the  case  in  which  the  speeiaen  shown  in  F?g. 
1  is  a  cross  section  of  an  axisyaaetic  body  under  an  axisyaaetric  det oraation. 
The  aaterial  is  assumed  to  be  transversely  isotropic  with  the  z-axis  being  the 
axis  of  syaaetry.  The  associated  problea  for  isotropic  aaterials  was  investi¬ 
gated  by  Delate  and  Erdogan  [6],  However,  their  dhjeetives  are  different  froa 
oars  and  hence  their  series  solution  is  different  froa  the  one  presented  here. 
It  is  seen  that  the  eigenfunction  associated  with  an  eigenvalue  6  no  longer 
contains  a  single  tern  P**f(^,&).  It  also  has  the  terns  p6+1f1(^,6), 
p6+2f2 (</',&)...  Therefore,  the  inclusion  of  the  second  and  higher  order  terns 
in  the  special  elenent  is  not  sinply  the  inclusion  of  the  eigenfunctions  asso¬ 
ciated  with  the  subsequent  snallest  eigenvalues  &.  A  ainilar  situation  occurs 
in  wedges  with  curved  sides  under  a  two-dimensional  def oraation  [7] . 


2.  MATHEMATICAL  FORMULATION.  We  choose  s  cylindrical  coordinate  systea 
(r,0,z)  in  the  transversely  isotropic  aediua  such  that  the  z-axis  is  the  axis 
of  naterial  syaaetry.  Let  (ur,ug,uz)  be  the  displaceaent  components.  We 
assuae  that  the  deforaation  is  axisyaaetric  in  which  uq  «  0  while  ur  and 
u  are  functions  of  r  and  z  only.  Introducing  the  displaceaent  potential 
4>(r,z)  from  which  ut  and  u2  are  given  by  [8-10] 


=  * 
r  ox 


u  If 

dz  * 


where  a  is  a  constant  to  be  determined,  the  stresses  have  the  expressions 


'r  =  CU  ^  +  C12  +  C13' 


„  32^  »  d<f>  x  d\ 

0  *  CI2  ^  C11  r0r  c13»  9i2 


_  _  _9£  , 

**  1i  If  r0r  C33«  ^ 


c44(1+m)  -M: 


in  which  Cj,  are  the  elasticity  constants.  The  equations  of  equilibrium  are 
satisfied  ifJ 


it  +  j±_jLi±wc0 

d?  rdt  "  p2  9z2  * 


where 


P2  - 


-c 


11 


c13  +  <1+“)c44 


c13m  +  ^+ffl^c44 


"c33“ 


or 

C11  +  c44p2  c13  +  c44 

— m  =  -  1  1  ■  - -  =  - 

(®13  +  c44)p2  c44  +  c33p2 

The  second  equality  of  Eqs.  (4a)  ard  (4b)  yield*  respectively. 


(4a) 


(4b) 


m2  -  2 


cllc33  ~  c13 
2c44(c13  +  c44} 


(5a) 


2  » 

A  cllc33  c13  "  2c13c44 

p  •  +  2  - 

2c33c44 

It  can  oe  snown  [11]  that  p  cannot  be  real  if  the  strain  energy  is 
positive  definite.  Ve  therefore  have  two  pairs  of  coupler  conjugates  for  p 
denoted  by  p^  pj,  p^  and  P2  where  an  overbar  denotes  the  coaiplex  con¬ 
jugate.  The  associated  m  are  denoted  by  st^,  ^ and  Te  see 

frosi  Eq.  (5a)  that 

m^m j  =  1  (5c) 

Since  Eq.  (5b)  is  a  quadric  in  p2  with  real  coefficients,  if  pj  f*  purely 
iaaginary  so  is  pj .  In  this  case  aj,  are  real  and  bj  =  mi,  *2  ■=  *2 . 

If  p^  and  P2  are  not  purely  iaaginary,  we  may  choose 

Pj  =  u  +  iv  ,  pj  =  -*u  +  iv  *  -pj  (6) 

where  u,  v  are  real.  In  this  case  mx,  ^  tre  complex  and  ai  «  a*.  In 
view  ot  the  fact  that  Eiq.  (3)  is  linear  in  the  general  solution  for  <f>  i* 
obtained  by  superimposing  <f>'t  associated  with  p  =  p1#  P2»  pi  and  pj. 

We  will  assume  that  Pi  £  P2  •  The  degenerate  case  Pi  =  P2  will  be  dis¬ 
cussed  in  Section  4. 


2  11  « 
pz  +  - -  0 

c33 


(5b) 


3.  EIGENFUNCTIONS  FOR  SP-UJ,  P.  Let  (r,z)  =  (a,0)  be  a  singular  point 
which  may  be  the  apex  ot  a  wedge,  a  notch  or  a  crack.  In  this  paper,  we  con¬ 
sider  the  cases  in  which  a  4  0.  Using  the  singular  point  as  the  origin,  we 
define  (Fig.  1) 

x  =  r  -  a  =  p cos^r,  z  =  psin^  .  (7) 

We  assume  that  ^  =  a  and  o'  are  the  free  surfaces.  To  find  the  eigen¬ 

function  for  <f>  which  is  valid  for  small  p,  we  rewrite  Eq.  (3)  as 
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ax2  ’  p2  a*2  #+x  ax  a  3x  •■*  '  a/ 


(8) 


K: 


m 


Let 


4>(k)  c  E  A[k)xtZ6+k-t+2/(6+k-t+2)(l>+k-t+l) 

t=0 


</> 


<k) 


(9a> 


C9b) 


Z  *  x  +  px 

vjieco  8  is  the  eigenvalue  end  *r*  constants  to  be  determined 

Using  Eq.  (7),  we  have 


(10) 


^Svl-t+2  . 
f  =•  coa^f  +  p  sin'/' 


(11) 

(12) 


Hence  </>(k)  is  of  ardex  ph+k+2  BJ  substituting  Eqs.  (9)  into  (8)  and 
equating  the  coefficients  of  xtZ*+k-t+2,  it  can  be  shown  that 


(k)  2k-l  (k-1) 

*k  "  IT  Vl 


(k  i  1) 


,<k)  .  i(k-X>  + _ L_  U 

2t  At"1  2  (8+k-t) 


<w)  -  <«>4S]  • 


(t  »  k-l*  fc-2  ...1  .  k  2  2) 


.(k) 


(13a) 


(13b) 


Hence  the  only  unknowns  are  A0  .  (k  «  0*1*2...)  which  are  determined  from 
the  boundary  conditions  at  ^  ■  u  and  a*. 


Jb.t'“  nplmd  b,  B{«.  CJ" 

tiveiy.  Thus  the  general  solution  for  is 


f«  mill  let  the  solutions  given  by  Eqs.  (9-13)  apply  to  p  «  vX,  For 

w.  A  .  .  h*.  .  ”  -V  «S)  “Vn6  •mv*1”*  K0,pt 
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ant¬ 
is  replaced  by 
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m 

E5g 


t*0  ' 


(14) 


+  D^k^x 


tz|+k_t+2| j (8+k-t+2  )  (8+b-t+D 
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Z,  =  x  +  p,J 


(•  *=  1,2) 


(15) 


4.  D3GFNEBATR  CASE:  aj  =  fc,  s  1.  When  Pi  =  P7«  P  ®ust  be  purely 
imaginary.  This  follow*  from  Eq.  (5b)  and  the  fact  that  p  cannot  be  real. 
By  Eqs.  (4b)  and  (5c),  we  have  ®i  =  =  1.  We  cannot  have  mj  =  ■£  =  -1 
because  this  would  sake  p  real. 

In  a  degenerate  case  Pi  =  Pc,  the  terms  associated  with  Bjj.k^  and  D^k/ 
are  identical,  respectively,  to  the  terms  associated  with  A^*7  and  C£k7. 

We  therefore  need  a  new  solution  for  B^  and  D^k^ .  This  can  be  accom- 
plisned  oy  replacing  the  coefficients  of  B^)  and  by  their  deri¬ 
vatives  with  respect  to  pj  *nd  $2  [12,13],  Thus,  for  instance,  Eq.  (14) 

becomes 


4>(k)  =•  £  |Ajk)xtZ8+k"t+2  +  c[k)xtZ6+k"t+2|y(6+t-t+2)(6+k-t+l) 

V  |B<k)«tZ6+k-t+1  +  D^k)zxtZ8+k-t+1^/(6+k-t+l) 


(16) 


t=0 


where  we  nave  oaitted  the  subscripts  1  and  2  for  Z  and  Z. 


S.  ASYMPTOTIC  SOLUTION  FOB  SMALL  P.  When  boundary  conditions  at  i  -  o 
and  a'  are  imposed,  one  obtains  a  system  of  homogeneous  equations  for  the* 
eigenvalue  &  and  the  associated  eigenvectors 

i(0>  a(0) 


q(0) 

“o 


_u  whose  elements  are 

*o  *  “o  '  C<°>  and  d£°*.  Thus  q^  is  determined  uniquely 
within  an  arbitrary  constant  £  .  One  also  obtains  systems  of  linear  but  non- 
homogeneous  equations  for  A^,  B^,  cik^  and  D|.k^  in  terms 
<IqK  Thus  the  eigenfunction  associated  with  an  eigenvalue  & 


of 

contains 


only  one  arbitrary  constant  £  .  Let  S 


>1,  &2 '  •  v  Be  the  eigenvalues 
£i ,  ^ ,  • . .  be  the 


arranged  in  the  order  of  increasing  magnitude  and  £  =  ?«,... 

associated  arbitrary  constants.  Using  Eqs.  (9a),  (11)  ana  (14),  the  asymp¬ 
totic  solution  for  <j>  for  small  p  can  be  written  as 


<£  =  £i|p6lfi>0(^»Bi)  +  p6l+lfl,l(^»5l>  +  pfl+2fl,2(^»6l;  +  •••} 
+  *2  {p62f2,0(^'*82)  +  p62+lf2,l('I,'82)  +  p62+2f2,2<^*82)  +  •••} 


(17) 


+  ... 


in  which 
sglujrion 


Xi.j  . 

is  given 


are  known  functions  of  .  <f>  and  6$.  The  first  fern  asymptotic 
by  the  term  p  kfi,o(^»Bi)  •  The  second  term  is 

P  x  ^1,1^'Bl)  If  Bj+1  <  and  p  *2, 0^**2)  if  ®i+l  >  Notice  that 
the  eigenfuncti'on  associated  with  an  eigenvalue  5  hs,s  infinitely  many  terms. 
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HIGHLY  VISCOUS  FLUID  FLOW 
IN  A  SPINNING  AND  NUTATING  CYLINDER 


Thorwald  Herbert 

Department  of  Engineering  Science  and  Mechanics 
Virginia  Polytechnic  Institute  and  State  University 
Blacksburg,  Virginia  24061 

ABSTRACT.  Spin-stabilized  projectiles  with  liquid  payloads  can  experi¬ 
ence  a  severe  flight  instability  characterized  by  a  rapid  yaw  angle  growth  and 
a  simultaneous  loss  in  spin  rate.  Laboratory  experiments  and  field  tests  have 
shown  that  this  instability  originates  from  the  internal  fluid  motion  in  the 
range  of  high  viscosity.  Evaluation  of  the  experimental  data  and  analysis  of 
the  equations  for  the  fluid  motion  in  a  spinning  and  nutating  cylinder  suggest 
a  thecret-cal  approach  in  three  major  steps:  (1)  analysis  of  the  steady  vis¬ 
cous  flow  in  an  infinitely  long  cylinder,  (2)  hydrodynamic  stability  analysis 
of  this  basic  flow,  and  (3)  analysis  of  the  end  effects.  The  basic  flow  has 
been  found  in  analytical  form.  At  low  Reynolds  number,  this  flow  agrees  well 
with  computational  results  for  the  center  section  of  a  cylinder  of  aspect 
ratio  4.3.  The  despin  moment  caused  by  this  flow  largely  agrees  with  experi¬ 
mental  data  for  a  wide  range  of  Reynolds  numbers.  Current  work  aims  at  the 
stability  of  this  flow. 

1.  INTRODUCTION.  It  is  well-known  that  spin-stabilized  shells  carrying 
liquid  payloads  can  suffer  dynamical  instability.  For  cylindrical  cavities 
and  low  viscosity  of  the  liquid,  the  instability  due  to  basically  inviscld 
inertial  waves  can  be  predicted  by  the  Stewartson-Wedemeyer  theory  [1,2]. 
This  theory  rests  on  the  boundary- layer  approach  and  is,  therefore,  restricted 
to  the  range  of  sufficiently  large  Reynolds  numbers.  The  Instability  of  cer¬ 
tain  shells  like  the  XM  761  [3,4],  however  escaped  such  a  prediction  and  is 
also  distinguished  in  character  owing  to  the  rapid  loss  in  spin  rate.  Experi¬ 
ments  with  a  full-scale  liquid  cylinder  [5]  and  subsequent  field  tests  |6] 
established  that  this  new  flight  instability  is  most  pronounced  for  liquid- 
fills  of  very  high  viscosity. 

We  conduct  a  theoretical  analysis  of  this  problem  in  order  to  support  the 
ongoing  experiments  and  to  independently  obtain  insight  into  the  anatomy  of 
the  flow  phenomena.  The  initial  steps  of  this  analysis  are  reported  elsewhere 
[7]:  evaluation  of  the  experimental  data  base,  dimensional  analysis,  scaling 
aspects,  governing  equations,  and  discussion  of  various  simplifying  assump¬ 
tions.  Two  observations  in  this  earlier  work  led  to  the  building-block 
approach  discussed  in  the  following.  First,  if  the  despin  (negative  roll) 
moments  (5]  and  void  observations  [8]  are  correlated  with  the  Reynolds  number 
Re,  at  ;east  three  regions  can  be  distinguished.  At  low  Re,  the  despin  moment 
increases  proportional  to  Re,  and  the  void  In  an  incompletely  filled  cylinder 
is  parallel  to  the  spin  axis.  This  suggests  a  simple  fluid  motion  that  is 
essentially  independent  of  the  axial  coordinate,  except  in  the  neighborhood  of 
the  end  walls.  In  a  middle  range  of  Re,  the  despin  moment  assumes  a  maximum, 
and  a  wavy  distortion  of  the  void  seems  to  indicate  a  cellular  structure  of 
the  fluid  motion.  This  cellular  motion  can,  in  principle,  originate  from 
hydrodynamic  instability  of  the  basic  flow  with  respect  to  axially  periodic 
disturbances.  At  still  higher  Reynolds  numbers,  the  despin  moment  decreases 
with  increasing  Re  in  a  manner  not  clearly  defined  by  the  few  available  data 
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V,  =  vc  +  V,,  P„  =  P  +  P . 
~n  ~s  ~d  n  s  a 


where  ys,P-  describe  the  state  of  pure  solid  body  rotation,  whereas  Vd,Pd 
represent  the  deviation  from  solid  body  rotation.  The  advantage  of  this  iso- 
1.U  '!  yiew  on  the  deviation  is  obvious:  Vd  and  the  reduced  pressure  Pd  are 
re-.^/nsible  for  the  observed  flight  instability.  A  glance  at  the  equations 
sh-js/s  that  Vd=0  and  Pd=0  if  either  one  of  the  following  conditions  is  satis¬ 
fied:  (o=0,  n=0,  e=0  or  (solid  fill). 

The  equations  for  Vd.  Pd  are  then  written  in  terms  of  nondimensional 
quantities  Vj,  p..  We  use  a,  u>,  and  p  for  scaling  length,  time,  and  mass. 
Note  that  this  choice  is  ambiguous  [71  and  excludes  the  case  to=0  (which  lacks 
practical  interest).  The  problem  then  depends  on  four  nondimensional 
parameters: 


x  =  c/a 
a  =  sine 

T  =  fl/co 

Re  =  puia2/u 


aspect  ratio 

f  requency 
Reynolds  number. 


The  aspect  ratio  enters  the  solution  only  through  the  boundary  conditions.  The 
boundary  conditions  on  yd  are  homogeneous. 

In  cylindrical  coordinates  r,$,z,  the  equations  for  the  nondimensional 
deviation  velocity  yd  =  (vr,v^,v2)  and  pressure  pc,  take  the  form 

1 1-  (rv  )  +  +  (2a) 

v  ay*  V  y*  *  v*  \  ' 


where 


D'vr  -  ^  -  2(1  +  ,2)  %  .  2%»z 

=  +  1_  |r,v  .V.2_f\,. 

sr  +  Re  lc  vr  r2  84  > 

D'%  2(1  +  Tz>  vr  -  2trvz 

1  8pd  1  V4,  ?  3Vr 

r3i  Re  1  4>  2  2  3<j>  J 


D'v  +  2t  v 
z  r  <p 


2Vr  ■  -  W  -  2rV  +  fe  D"vi 


Did  d  d  d 

3t  3(f)  r  3r  r  341  z  sz 

nil  =  aL  U  1  5_  +  L  ii-  +  ii_ 

r  8r  r2  34.2  2Z2 
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t  =  -ecoS(j),  =  ssin*,  t  =  xcose,  e  =  tsine  (3) 

r  9  z 


^-periodic  force  term 
If  this  term  vanishes 

For 


The  primary  effect  of  nutation  is  contained  in  the 
-2r?r  =  2ercos<j>  in  the  z-momentum  equation  (2d), 
throughout,  e=0,  equations  (2)  support  a  trivial  solution  yd=0,  pd=0. 
sufficiently  small  e*0,  it  is  obvious  that  the  deviation  velocity  is  of  order 
0(e).  In  the  situations  of  practical  interest,  e  =  (o/oj)sin9  turns  out  to  be 
a  rather  small  parameter  Even  a  conservative  estimate  with  a  <  500  rpm, 
w  >  3000  rpm,  and  9  <  20u  provide:  values  of  e  <  0.057.  Consequently,  it 
seems  well  justified  to  linearize  the  equations  in  e.  This  linearization 
imposes  no  restriction  on  the  Reynolds  number. 

3.  THE  BASIC  FLOW.  The  system  of  equations  aft'ir  linearization  is  still 
quite  difficult  to  solve.  Any  serious  attempt  to  satisfy  all  boundary  condi¬ 
tions  leads  directly  to  a  purely  computational  approach.  Use  of  the  boundary- 
layer  approximation  would  simplify  the  task  but  seems  inappropriate  in  the 
interesting  range  of  low  Reynolds  numbers.  Recalling  that  the  flew  in  a  rela¬ 
tively  long  cylinder  (aspect  ratio  x=4.3)  at  low  Re  exhibits  little  axial 
variation  over  much  of  the  cylinder  length  [71,  we  have  relaxed  the  boundary 
conditions  at  the  end  walls.  As  a  first  step,  we  seek  for  a  steady  flow  in  an 
infinitely  long  cylinder. 

At  closer  analysis,  the  z- independent  force  term  in  eq.  (2d)  can  only  be 
balanced  by  a  purely  axial  deviation  velocity.  It  is  consistent  with  the  lin¬ 
earized  equations  to  assume  a  solution  in  the  form 


and  moreover. 


yd  =  (0.  o,  vz),  pd  =  0 


v2  =  vz(r,<t>)  =  2e[f(r)cos<j>  +  g(r)sin<f] 


(4) 

(5) 


Substituting  (4), (5)  into  the  linearized  equations  provides 


f"  +  b  f'“  hr  f  ~  Re  g  =  -  Re  r 
r  r 1 

(6a) 

g"  +^9'  -  719  +  Re  f  =  0 

(6b) 

f=0,  g-0  at  r=l 

(6c) 

f,  g  finite  at  r=0 

(66) 

The  primes  denote  d/dr.  For  Re+0,  the  solution  of  these  equations  can  be 
found  in  the  form  of  series 


f-!r<r 


r3)  - 


Re3 

9216 


(7r  -  12r3  +  6r$  -  r7)  +  0(Re5) 


(7a) 
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With  higher  terms  included,  these  series  converge  for  Re  <  12.  In  the  limit 
Re  ♦  ®,  one  obtains 


f  +  0,  g  ♦  r  '  as  Re  +  ».  (8) 


Owing  to  the  loss  of  the  highest  derivatives,  however,  this  solution  cannot 
satisfy  the  boundary  conditions  (6c)  and  is  valid  only  outside  the  thin  bound¬ 
ary  layers  near  the  wall  at  r=l.  Even  without  any  knowledge  of  the  solution 
in  the  intermediate  range,  the  different  character  of  the  basic  flow  at  low 
and  high  Reynolds  numbers  is  evident.  At  low  Re,  the  component  f  in  the  z,x- 
plar.a  <j>=0  is  dominating.  At  high  Re,  f  Is  negligible  in  the  core  of  the 
cylinder  while  g  in  the  z,y-plane  <j»=90°  is  dominating. 

In  earlier  work  (10),  we  have  applied  a  spectral  collocation  method  for 
numerical  solution  of  eqs.  (6).  Here,  we  derive  an  analytical  solution  by 
introducing  the  complex  function  F»g+1f.  Eqs.  (6)  can  then  be  written  in  the 
form 

r2F"  +  rF'  -  (1  +  IRe  r2)F  =  -  iRe  r*  (9a) 

F  =  0  at  r  =  1  (9b) 

F  finite  at  r  =  0  (9c) 

A  particular  solution  of  the  Inhomogeneous  equation  (9a)  Is  F0=r,  whereas  the 
homogeneous  part  of  (9a)  is  the  equation  for  the  modified  Bessel  functions 
I^qr)  and  Kj(qr)  of  the  complex  argument  qr  with  q  =  /R e/2  (1  +  1)r.  For 
(9c),  K,  (qr)  cannot  contribute  to  the  solution.  Finally,  (9b)  provides 

F(r)  =  g  +  if  =  r  -  Ii(qr)/Ii(q).  (10) 

Expressing  the  solution  in  terms  of  Kelvin  functions  of  real  argument  is  of 
little  advantage  for  the  numerical  evaluation.  The  solution  is  valid  for 
arbitrary  Re  but  may  be  unstable  as  Re  exceeds  some  critical  value.  It  Is 
straightforward  to  derive  the  approximations  (7)  from  the  ascending  series  for 
Ij  (and  to  explain  the  convergence  problem  for  larger  Re).  The  asymptotic 
expansion  for  large  arguments  provides 

F  ~  r  -  /r  e^1"'1*.  (11) 

This  expression  agrees  to  within  1$  with  (8)  provided  that  r  <  1  -  6.  The 
boundary  layer  thickness  6  can  be  obtained  from  the  transcendental  equation 

6  =  /27Re  [4.605  -  |  in  (1  -  «)],  (12) 

e.g.,  6=0.223  for  Re=10Q0.  The  characteristic  changes  in  the  flew  structure 
with  increasing  Re,  in  particular  the  shift  of  the  velocity  maximum  from 
at  Re=2  to  4>=90°  at  Re=209  are  shown  in  Fig.  2. 
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3-  .  ,R?d.1£1,  distribution  of  the  axial  velocity  (In  fps)  at  2-0  for 

Re-14.9  ano  Re=45.7.  The  symbols  show  the  solution  to  the  full  Navier-Stokes 
equations* 


Fig.  3  compares  the  dimensional  velocity  distributions  obtained  from  (5), 
(10)  with  computational  results  for  the  center  cross-sectiori  (z=0)  of  the 
cylinder.*  The  agreement  for  Re=14.9  is  considered  representative  for  the 
range  of  lower  Reynolds  numbers.  The  numerical  simulation  provides  very  small 
components  vr,  v  and  hence  verifies  our  estimates.  At  the  higher  value 
Re=45.7,  a  systematic  deviation  between  the  two  results  seems,  to  be  due  to  a 
superposed  cellular  motion  that  is  not  yet  incorporated  into  our  analysis.  I  . 
is  encouraging,  however,  that  the  simple  theory  of  the  basic  flow  yields 
results  in  essential  agreement  with  the  computational  solution  of  the  full 
Navier  Stokes  equations  for  a  finite-length  cylinder. 

4.  MOMENTS.  With  the  deviation  velocity  y^  =  (0,0, wav,)  and  v2  given, 
the  moments  on  a  finite-length  section  of  the  ynnder  can  be'calculated.  We 
consider  a  control  volume  R  (surface  S)  formed  by  the  solid  cylindrical  wall 
and  liquid  surfaces  at  both  ends.  Conservation  of  angular  momentum  requires 


H*i(r,y=  |r  ;;j(r  X  Vd)pdR  +  JJJ  (r  «  (2a  „  V  .)JodR 
~  at  R  -  -a  r  ~  ~a 

+  (n *  yp^d  ■  2)dS  +  fJ  (r  *  ys)p(vd  •  n)ds 


(13) 


where  n  is  the  outer  unit  normal.  On  the  left-hand  side,  M  is  the  resultant 
torque~on  the  control  volume.  The  second  term  accounts  for  the  moments  due  to 
the  shear  force  Fd  and  vanishes  owing  to  the  solid  sidewall  and  cancellation 
of  the  contributions  from  both  ends**.  On  the  right-hand  side,  the  first  term 
vanishes  for  steady  Vj.  The  second  term  originates  from  Coriolis  forces  in  the 
nutating  system.  The  third  term  vanishes  since  Vd  has  only  an  axial  compo¬ 
nent.  The  last  term  then  provides  the  net  rate  of  angular  momentum  flux 
through  the  control  surface. 

Substitution  of  Vj  leads  to  the  following  expressions  for  the  cartesian 
components  of  M: 

M  =  m  (2oasine)(u>a)  m  ,  m  =  -  J*  r2fdr  (Ida) 

A  Xr  A  A  Q 

M  =  m  (2nasine)(«a)  m  ,  m  -  -  J  r2gdr  (14b) 

y  y  y  o 

M  =  m  (2nasine)2  m  ,  m  =  f1  r2fdr  =  -m  (14c) 

Z  Z  Z  /.  X 


where  m^  is  the  liquid  mass  in  the  cylinder.  In  this  form,  the  components 
Mx,  M  represent  the  net  rate  of  angular  momentum  flux  through  the  liquid  end- 
walls,  whereas  the  roll  moment  Mz  is  solely  due  to  Coriolis  forces.  A  close 


*The  data  were  kindly  provided  by  Or.  H.  Vaughn,  Sandia  National  Laboratories. 

♦♦Improper  account  of  the  sidewall  conditions  introduced  an  incorrect  factor 
of  two  in  earlier  results  for  Mx,  My  (10). 
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relation  between  roll  moment  Mz  and  yaw  moment  Mx  has  also  been  found  by 
Murphy  (111  for  the  range  of  high  Reynolds  numbers. 

A  different  interpretation  can  be  derived  using  the  differential  equation 
(9a),  integrating  by  parts,  applying  (9b),  and  separating  real  and  imaginary 
part: 

mz  =  "mx  =  /*  r2fdr  =  "  U5a) 

my  =  -  j'  r*qdr  =  -  (15b) 


In  this  form,  the  moments  are  directly  related  to  the  shear  forces  at  the 
cylindrical  sidewall,  r=l.  Since  f'(l)<0,  g'(l)<0,  the  roll  moment  M2  is 
always  positive  (even  for  n<0),  while  Mx  is  negative  for  n<0  and  changes  sign 
with  q.  For  small  Re,  the  series  (7)  provide  the  approximations 


Re 
3  96’ 


my 


Re  z 
1536 


(16) 


that  can  be  used  for  quick  estimates  up  to  Re<10.  The  linear  increase  of  m2 
and  M2  with  Re  is  consistent  with  the  experimental  data.  From  the  analytical 
solution  (10),  we  obtain 


F'(l)  =  9'd)  +  1f'(l)  =  2  -  ql^qj/l^q).  (17) 


Substitution  into  (15)  provides  the  variation  of  m2,  m  with  the  Reynolds 
number  shown  in  Fig.  4.  The  coefficient  m2  assumes  a  pronounced  maximum  at 


rig.  4.  The  -nondimensional  coef¬ 
ficients  m2,  my  in  eq.  (14)  versus 
the  Reynolds  number.  Re. 


Fig.  5.  Comparison  of  the  theoreti¬ 
cal  result  for  mz  with  experimental 
data  [5].  The  straight  line  shows 
the  asymptotic  law  m7sRe/96. 
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Re« 19.  The  coefficient  my  1S  negative  and  reaches  an  asymptotic  value  of 
my  *  -  1/4  as  Re  +  <».  Hence,  for  n>0,  My  reduces  the  pitch  moment  due  to 
tne  solid  body  rotation.  This  result  is  consistent  with  computations  [91.  A 
comparison  of  theoretical  and  experimental  [5]  results  for  m^  is  given  in  Fig. 
5  on  a  double  logarithmic  scale.  The  initial  spin  rate  u>=4000  rpm  was  used 
for  reducing  the  experimental  data.  For  Re<10,  the  experimental  points  match 
the  analytical  result  as  well  as  the  asymptotic  law  m2  *  Re/96.  The 
occurrence  of  a  maximum  of  m2  is  found  to  be  a  property  of  the  basic  flow. 
Only  the  systematic  deviation  for  higher  Reynolds  numbers  Re<200  may  be 
attributed  to  a  cellular  motion.  The  two  data  po.its  for  Re>103  probably 
indicate  turbulent  flow. 

The  basic  flow,  hence,  can  be  considered  a  first  but  essential  step 
toward  understanding  and  predicting  the  gross  features  of  the  fluid  morion  in 
a  spinning  and  nutating  cylinder.  Some  observations,  however,  such  as  the 
virtual  independence  of  the  despin  moment  on  the  spin  rate  require  further 
analysis,  especially  of  the  end  effects.  The  occurrence  of  a  cellular  motion 
may  be  due  to  hydrodynamic  instability  of  the  basic  flow. 

5.  STABILITY  ANALYSIS.  The  stability  analysis  is  currently  conducted 
and  only  a  brief  outline  is  given  here.  We  superpose  to  the  steady  flow 
vn=(o,r,vz),  pn  disturbances  v'=(utv,w),  p  sufficiently  small  for  lineariza¬ 
tion.  Substitution  into  eqs.~(2)  and  neglect  of  products  between  disturbances 
and  terms  of  order  0(e2)  provides  the  following  stability  equations: 


Three  groups  of  terms  have  been  separated  by  braces  in  eqs.  (18a-c).  The 
first  group,  if  set  to  zero,  represents  the  equations  for  inviscid  inertial 
modes  ~exp(im$  +  iaz  +  st),  where  m  is  the  (integer)  azimuthal,  o  the  axial 
wavenumber,  and  s  =  sr  +  is-j  provides  the  amplification  rate  sr  (=0)  and  fre¬ 
quency  si.  Usually,  an  equation  for  the  pressure  is  used  for  obtaining  the 
analytical  solution.  We  have  derived  an  alternative  system  In  terms  of  u,v 
and  applied  the  spectral  method  to  be  used  for  more  general  cases  in  order  to 
check  the  numerical  results  against  the  exact  values. 
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The  second  group  of  terms  multiplied  with  1/Re  represents  the  viscous 
correction  to  the  inertial  modes.  Ey  eliminating  w  and  p,  a  system  of 
ordinary  differential  equucions  for  u  and  v  has  been  derived.  Due  to  the 
higher  order  of  this  system,  all  boundary  conditions  can  be  satisfied.  Pro¬ 
grams  have  been  developed  for  calculating  spectra  of  (complex)  eigenvalues, 
for  tracing  single  eigenvalues  as  function  of  Re,  o,  and  for  obtaining  the 
eigenfunctions.  At  high  Re,  the  results  follow  the  trends  predicted  by 
asymptotic  theories.  Our  analysis,  however,  also  covers  the  range  of  low 
Reynolds  numbers,  where  the  inertial  modes  suffer  rapid  decay  (sr<0). 

The  most  interesting  aspect  of  the  stability  equations  (18)  is  the  third 
group  of  terms.  The  coefficients  in  this  group,  v2,  tr,  and  are  (1)  of 
order  0(e)  and  (2)  periodic  in  The  periodicity  in  <t>  leads  to  a  coupling  of 
the  mode  equations  for  m  and  m+1,  and  may  cause  primary  resonance  between 
inertial  modes.  In  view  of  viscous  damping,  this  resonance  is  likely  to  occur 
as  e  exceeds  a  critical  value  that  decreases  as  Re  increases.  The  analysis  of 
this  parametric  instability  is  currently  in  preparation. 
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Abstract.  In  large  linear  programing  problems,  rounding  off  type  errors  can 
cause  problems,  these  errors  stem  from  the  computational  set  which  is  used  by 
a  computer.  Consider  c>  0,  sets  F  and  R  where  R  is  the  set  of  rational  numbers 
and  FcR,  there  exists  a  ,  p  ,  7  ,  and  5  e  F  with  a  G,  0  gt  0  and  7  ;£  -  5 
such  that  whenever  a..  p  <  t  or  7  +  5<f  ,  then 

a  •  0  =0 

7  +  5  =0. 

It  is  shown  that  this  set  is  commutative,  contains  additive  and  multiplicative 
inverses,  is  nonassociatiye,  nondistributive  with  zero  divisors. 

Two  other  sets  which  are  quadratic  extensions  of  the  field  r  ,  Cr  and  Cj 
that  are  nonassociative  algebras  with  zero  divisors  are  developed  with  an  ex¬ 
ample  of  how  they  can  be  used  to  solve  a  differential  equation  of  the  form 

dSi 

P  - -  =EdpKp 

dt 

where  P,  0p  and  Xp  are  vectors  of  any  dimension.  Cr  is  used  to  solve  the  case 
where  JpKp  =  KpJp  and  Cj  the  more  general  noncommutative  case.  Finally,  the 
field  properties  of  F,  cr  and  Cj  are  compared  with  some  implications  for 
software  as  a  result  of  the  loss  of  field  properties. 

1 .  The  concept  of  a  computational  set  F  which  permits  round-off  errors.  FCR 
where  R  is  the  set  of  rational  numbers.  The  aritlmetic  on  F,  for  .lot  neces¬ 
sarily  zero  elements  a,/3, 7,5  ,  X  and  u>  with  7#  -  5  and  (  >  0,  is 

(  X  «  F,  if  lXl>  e 

(1)  a  +  0  =  / 

(  0  c  F,  if  [XJ<  e 


!W  e  F,  if  |«|  >  e 
OeF,  if  l«|  <  e 

An  additional  property  of  this  set  is  the  existence  of  a  whole  number  N  such 
that  N  >  N  +  1. 

As  a  consequence  of  the  definition  of  the  set  arithmetic  as  well  as  the 
properties  of  the  rational  numbers,  the  set  is  commutative  and  closed  under 
the  operations  of  addition  (+}  and  multiplication  (•).  The  set  also  contains 


rttVieas  PAGE 
IS  BLANK 


895 


multiplicative  and  additive  inverses.  However,  not  all  elements  have  multi¬ 
plicative  inverses.  For  example,  the  number  3  does  not  have  an  inverse  because 
the  reciprocal  of  3  is  some  truncated  version  of  1/3.  For  this  example  let  us 
consider  1/3  =  .33,  then  3  x  .33  =  .99^1. 

Additionally,  as  shown  by  the  following  two  examples  (3)  and  (4)  additive 
inverses  are  not  unique.  Let  a  =  .011,  /3  =  -.012,  5  =  -.013  anti  *  =  .01 

(3)  a+/3  =  -.001 

=  0 

because  j -.001 |  <  e  ,  and 

(4)  q+5  =0 

for  the  same  reason.  Thus  additive  inverses  exist  but  are  not  unique.  Because 
of  this  characteristic,  the  associative  property  for  the  set  F  does  not  always 
prevail . 

The  set  F  has  zero  divisors.  Consider  a  ,  ^  and  t  defined  as  above,  then 

(5)  a.0  =  -0.000132. 

Since  j-.000132|<t  ,a*/5  *  0.  Moreover,  for  a,0  and  t  defined  as  above 
and  5  =  1000 

(6)  (  a«  /3  )•  6  a*  (  /3 *  S  ) 
and 

(7)  «'(  a+0  )  4  S-fi 

Thus  F  is  nonassociative  and  nondistributive. 

A  consequence  of  the  properties  of  F  is  that  software  using  assumed 
properties  can  give  misleading  results.  For  example  in  FORTRAN  although  it  is 
often  assumed  to  be  correct,  statements  (8)  and  (9)  are  not  always  the  same, 

(8)  SUMX  *  SUMX  +  C*X 
and 

(9)  S'JMX  =  SUMX  +  X 

SUMX  =  C*SUMX 
where  C  is  a  constant. 

2.  The  concept  of  a  quadratic  extension  T  of  a  field  K  of  characteristic 
*2.  r  is  a  set  with  a  product  xy  and  sum  x  +  y  defined  on  it  with  a  subset_ 

X  =  K  as  well^  as  an  automorphism  Z  -*Z.  For  each  Z€T  there  is  a  T  *  Z  +  Z 
and  an  N  =  ZZ,  for  T,  N  6X.  For  each  Z6T,  we  have 

(10)  Z2  -  TZ  +  N  =  0 
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It  can  be  assumed  that  r-^X.  In  this  case  there  exists  at  least  one  element 
v  6X.  These  elements  constitute  the  set  V.  It  can  be  shown  that 


(11)  r  =  x(±)V. 

The  image  of  the  automorphism  Z  is  called  the  conjugate  of  Z.  T  is  its 
trace,  N  the  norm,  X  is  the  set  of  invariant  elements  and  V  the  set  of  skew- 
conjugate  elements. 

Considerable  work  has  been  done  on  sets  r  .  In  [1]  r  is  characterized 
for  the  real,  complex,  quaternion  and  Cayley  number  systems.  In  L2J  ris 
generalized  to  a  field  K  where  there  is  an  element  i  such  that  i 2  -  /3 i  -  a  =  o. 
The  concept  is  then  extended  to  the  case  where  K  is  a  commutative  ring  with 
unit  that  admits  ari  invclutoria!  automorphism  in  [3]  and  in  [41  the  geometry 
of  the  Z-plane  of  a  quadratic  extension  r  of  a  field  K  is  discussed.  In  [5]  and 
[6]  there  are  examples  of  when  T  is  a  nonassociative  algebra. 

Essentially  quadratic  extensions  belong  to  a  class  of  algebras  commonly 
known  as  Clifford  numbers.  Van  der  Waerden  in  [7]  discusses  a  class  of  these 
numbers  in  his  section  on  hypercomplex  numbers. 

3 .  The  concept  of  the  commutative  algebra  C,i  oyer  a  field  K  of  characteris¬ 
tics  •$.  2.  C.i  is  an  algebra  of  order  J>T  where  1  e3,.. . ,e^  is  a  basis  for 

"Cjl!  IrPthe  case  of  J  =  1 ,  Cj  =  K.  Using  the  operations  defined  on  K,  Cj  has 
the  following  sum  and  products  defined  on  it  for  all  a,  b«Cj  and  «  ,ai ,  b-j«K 

(12)  a  +  b  =  2  (ai  +  bj ) e-j , 

(13)  a  =  a2  a^, 

Id  U 

(14)  ab  =  (a]b]  -  2  a-jbj)l  +  2  (ajb^  +  a-jbiJe-j. 

i  =  2  i  =  2 

_  0 

(15)  The  automorpnism  a*~*a  =  ajl  -  2  a-j e-j . 

i  =  2 

The  unit  or  I  in  r  is 

J 

(16)  1=1+  2  oei. 

i  =  2 

The  trace  T  and  norm  N  are 


(17)  Ta  =  a  +  a  and  (18)  Na  =  aa. 

Ta  and  Na  €  K  and  ab  =  ba  for  all  a,b  6  Cj.  When  the  characteristic  of  K 
is  0,  K  is  real.  The  set  Cj  has  inverses 

(19)  a-’  4a 
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Cj  has  some  interesting  properties.  For  n  =  2,  Cj  ^  4>  c  <£  ,  the 

complex  numbers,  for  appropriate  restrictions  on  K,  For  n>2,  Cj  is  non- 
associative,  nonalternative  and  contains  zero  divisors.  Moreover. 

(20)  Nab  *  NaNb 

(21)  Na2  *  (Na)2 

in  general.  When  K  =  4>  c  (C  which  contains  at  least  one  element  v  =  -v, 

(19)  does  not  always  hoTd  true.  For  K  9  (£  ,  (10)  holds.  Hence,  this  class 
c  algebras  Cj  are  quadratic  extensions  of  the  field  K. 

4.  An  application  of  the  class  of  algebras  C,i  to  the  physical  sciences.  In 
the  physical  sciences  such  equations  as 


(22)  P  =  dSi=£JPKp 


are  frequently  encountered  where  P,  Jp  and  Xp  can  be  vectors  of  any  dimension 
n  =  l,2,...,nf.  If  P,Jp,Xp  Cj  with  0  =  n,  then  (22)  has  the  representation. 


2  (jpi  xp]  -  2  jPi  xPi)  =  Pj 

P  =  1  1-2 


and  for  N, 


2  2  ( j  pi  Xp-j  +  jpi  Xpi )  -  P-j 

P  =  1  i  =  2 


Np  =  ?  P-j^  =  p  ^  ^i  jpi  xPi 


=  2  (2  J pi  xpi  +  2  2  jp^  xpi  Xj^) 

Pi  k^p  i 


2  2  (Pi  jpixpi  -  PiOpXpi )  =  0 

(25)  P  i=2 


2  2  2, 

and  '  p  k*p  i=2  (o Pi xpu ki xki  -  JPiXpiJk-lXkl )  =  C 
where  p,  j ,  x  tK. 

(23),  (24)  and  (25)  hold  regardless  of  whether  K  is  the  set  of  real  num¬ 
bers  or  the  set  of  complex  numbers.  In  this  example  it  should  be  remembered 
that  Cj  is  commutative.  The  noncommutative  case  will  be  treated  in  the 
next  two  sections.  The  last  section  will  address  the  case  that  Op  is  a 
matrix  or  linear  transformation. 

5 .  The  concept  of  the  noncommutative  algebra  Cn  over  a  field  K  of  charac¬ 
teristic  2.  '  Cn  is  an  algebra  of  order  n  >  1  defined  as  in'  section  i  with 
the  one  modification. 
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,  m 

ab  =  (a^b-j  -  £  a,b^)l  +  (a-j  b2  +  a2bl  +  a3b4  +  a4b3)e2 

i  =  2 


(14')  \  +  (a]b3  -  a£b4  +  a3bi  -  a^Jea 


1+  £  (a]bi  +  a£b3  -  a3b2  +  aib^ei. 

N  i  =  4 

Cn  has  all  the  properties  discussed  in  section  2  with  the  exceptions  that 
Cn  is  no  longer  generally  commutative  and  for  n  =  4  there  exists  a  Cn  =ss Q, 
set  of  quaternions. 

6 .  An  application  of  the  class  of  algebras  Cn  to  the  physical  sciences. 
In  tne  case  that  the  elements  P,  Jp  and  Xp "are  noncommutative,  we  have  as 
a  solution  to  (22) 

n 

( E  (Jpixpi  -  I  jpi  xPi)  =  P] 

P  i  =  2 


(23') 


1  (lPixP2  +  JP2XP1  +  J  P3XP4  +  J  P4XP3 )  =  p2 
P 

E  (J  pi  xp3  -  jp2xP4  +  JP3xPl  "  -3 P4XP2 )  =  p3 
P 

n 

E  P2xP3  *  3 P3XP2  +  E  (j PI xPi  +  Jpixpi)]  =  Pi 
P  i  =  4 

\ 

Nn  is  again  yiven  by  (24)  or  is  invariant  and  (25)  is  given  below  by  (25') 

(  £  £  (j PI XP1  Jk2xk2  -  J P2xP2-3 kl xkl  -  3p3xP3Jk4xk4  -  3P4xP43k3xk3) 

P  k*P 

Z  Z  (jpixPlJk3x<3  +  3 P2xP2-k4xx4  -  3>3xP33klxkl  +  3p4xP43k2xk2) 

P  k*P 

n 

E  z  [(3p2xP23k3xk3  +  3>3xP33k2xk2  +  £  UpixPl3kixki 

P  k*p  i  =  4 


(25 


■ 


-  J  Pi  xPi  J  kl  xkl  )3  =  0 

E  (p13P2xP2  "  P 23  PI x PI  -  P3JP4XP4  -  P43P3XP3)  =  0 
P 

E  (p13P3xP3  +  P23P4XP4  ~  P3J*  PI  XP1  +  p43p2xP2)  *  0 
P 

n 

E  [“p23 P3xP3  +  P3JP2XP2  +  Z  (p13PixPi  -  pi3PlxPl)]  =  0 
P  i  =  4 
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Moreover,  it  can  be  shown  that  the  equations  given  by  (23),  (23'),  (24), 

5)  and  (25')  hold  when  K  is  an  associative  ring  with  unit.  In  the  event 
that  K  is  the  real  numbers{24)^  0. 

7 .  An  application  of  C.i  and  Cn  in  the  case  that  the  dp's  are  linear  transforma¬ 
tions^  For  Jp  a  linear  transformation  and'  P,xp  Cj  the  rot  lowing  representations 
hold  for  Np. 


with 


(26) 

Si  =  x 

t 

i 

pi  =  z  ^  E  pi j  pki  xPi 
n 

(27) 

S2  =  £ 

P 

£  [?i  I  JpkixPi  -  pk  E  J  PI  i  xPi  3 

k=2  i  i 

(28) 

s3=  X 

X  'x  jpkixpi)2 
k  1 

(29) 

S4  =  X 
p 

X  X  [(  X jpki xPi ) (  X j 1 ri xPi )3 
1*P  k  1  i 

n 

(30) 

S5  =  X 
p 

£  £  [(£  Jpliv-Pi  )(£ jqkixqi )  - 

q*P  k=2  i  1 

(31) 

Si  =  S3 

+  S4  and 

(32) 

S2  =  S5 

=  0 

For  Jp  a  linear  transformation  and  P,  Xp  Cn  the  following  representations  hold 
for  Np. 

(33)  Sg  =  £  £  [(£  j pi i xpi ) (2  jq2i xqi )  *  (Z  jP2ix?i)(z  Jqlixqi) 

P  QjrP  i  i  i  1 

■  (£  JP3ixPi)(x  Jq4ix^)  *  (£  J?4ixPi)(E  Jq3iXqi)j 

i  i  i  i 

(34)  S7  -  Z  X  [(X  ipiixpi  )(Z  jq3ixqi)  +  (X  jp2ixPi  )(£  Jq4ixqi ) 

P  q*P  i  i  i  i 

-  (£  jp3ixPi)(2  Jqlixqi)  +  (E  JP4ixPi)(z  J q2i xqi )I* 

i  i  i  i 

(35)  Sg  =  X  X  [-(X  jp2iXpi)(E  Jq3ixqi)  +  (z  3P3i xPi ) C'E  Jq:!Xqi) 

P  q^P  i  1  i  i 

n 

+  X  [(z  jpiixPi)(x  Jqki  xqi )  -  (£  1p:<ix»i )  (z  jqlixq’)ll 
k=4  i  i  i  i 

(36)  Sg  =  x  [Piz  jp2ixPi  -  P2X  JP1ixPi  -  P3E  lP4ixPi  -  p*£  0P2ixPi3 

Pi  i  i  i 
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(37)  Sjo  =  l  [P]Z  Jp3iXpi  +  P2r  JP4ixPi  -  P3E  JplixPi  +  p4z  JP2ixPi3 

Pi  i  i  i 

n 

(38)  Su  =  2  [-P2z  Jp3ixPi  +  p$  Jp2ixPi  +  (pU  J>kixPi  -  piE  JPlixPi)] 

Pi  i  k=4  i  i 

with  (31)  as  far  the  commutative  case  and 

(39)  Sg  =  Sj  =  Sg  =  Sg  =  Sip  =  Sn  =  0. 

Thus  it  he.'  been  shown  that  Cj  and  Cn  can  be  used  to  solve  vector  equations. 

8.  A  comparison  of  the  fiald  properties  of  F,  C.i  and  Cn.  As  shown  below  in 
Table  1,  P  does  not  have  all  the  properties  that  Cj  and  Cr  have  with  respect  to 
addition  and  multiplication  inverses.  It  is  interesting  to  note  that  there  is 
not  much  interest  in  using  sets  such  as  Cj  and  Cr.  Howeyer,  we  use  a  set  such 

F  CJ  cN 

+  •  +  •  +  • 

yes  yes  yes  .  yes  yes  yes 

no  no  yes  no  ,  yes  no 

yes  yes  yes  yes  ,  yes  no 

yes  yes  yes  yes  ,  yes  yes 

not  no  yes  yes  ,  yes  yes 

unique 

no  no  nc 


Table  1 

as  F  in  our  computers  and  it  has  clearly  fewer  properties  than  Cj.  Perhaps  this 
will  change  with  time  when  people  become  adjusted  to  the  realization  that  the 
number  sets  which  we  use  may  not  contain  all  the  features  that  one  might  desire. 


Closure 
Associativity 
Commutativity 
Identi ty 
Inverse 

Distributive 


.VJT.v.-.Wiiit;  Ulauittoo  UO *U1Mj, 


%iJik  : 


Bibliography 

1.  Curtis,  C»  W.,  "The  four  and  eight  square  problem  and  division  algebras," 
Studies  In  Modern  Algebra,  vol.  2,  Mathematical  Association  of  America,  1963. 

2.  Oe  Cisco,  J.,  "Introduction  to  the  theory  of  a  quadratic  extension  of  a 
field  K,"  Universita  e  Politecnico  di  Torino  Rendiconti  del  Seminario  Matematico, 
vol.  17,  1957/58,  pp.  223-251. 

3.  ,  "Some  theorems  concerning  commutative  rings  with  unit  which 
admit  invoTutorial  automorphisms."  Reale  Accademia  della  Scienzedi  Torino. 

Atti  Classe  di  Scienze,  Fisiche,  Mathematiche  e  Natural! ,  vol.  92.  1957/58, 

pp,  '2?r-24?. - 

4.  ,  "The  geometry  of  the  Z-plane  based  on  a  quadratic  extension 
of  a  field  k,"  Universita  e  Politecnico  di  Torino  Rendiconti  del  Seminario 
Matematico,  vol.  18,  1 958/59,  pp.  91-119. 

5.  Jacobson,  N. ,  "Structure  and  representations  of  Jordan  algebras," 

American  Mathematical  Society  Colloquium  Publications,  vol.  39,  American 
Mathematical  Society,  1968. 

6.  Kleinfeld,  E. ,  "A  characterization  of  the  Cayley  numbers,"  Studies  In 
Modern  Algebra,  vol.  2,  Mathematical  Association  of  America,  vol.  2,  1963. 

7.  van  der  Waerden,  B.  L, ,  Modern  Algebra,  vol.  1,  New  York,  Frederick 
Ungar  Publishing  Co.,  1953. 


902 


■  CALCULATION  OF  LOWER  CONFIDENCE  BOUNDS 
ON  SYSTEM  RELIABILITY 

Joseph  V.  Michalowicz 
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ABSTRACT,  A  general  methodology,  based  on  algorithms  developed  by  the 
Ad-Hoc  Methodology  Working  Group  on  Nuclear  Weapons  Reliability  Assessment,  is 
described  for  evaluating  90%  lower  confidence  bounds  on  system  reliability  for 
configurations  of  series/parallel  circuits.  General  configurations  of  non- 
repeated  and  repeated  components  are  examined  and  a  method  for  uripooling  data 
is  discussed.  A  technique  is  derived  for  representing  "m  out  of  n"  decision 
logic  gates.  The  methodology  is  applied  to  an  example  of  the  type  of  a 
sophisticated  weapon  fuzing  system.  Maximum  likelihood  estimates  of  relia¬ 
bility  and  90%  lower  confidence  bounds  are  calculated  for  the  system  and 
critical  components  are  identified. 

1.  INTRODUCTION.  For  critical  and  expensive  weapon  systems,  such  as 
nuclear  projectiles,  highly  reliable  subsystems  are  required  to  produce  a  high 
probability  of  successful  system  performance.  Not  only  must  the  reliability 
of  these  integral  subsystems  be  very  high,  but,  since  often  relatively  few  of 
such  weapon  systems  will  be  used  :o  attack  an  enemy  target,  there  must  also  be 
a  high  degree  of  confidence  that  such  reliability  will  be  achieved.  This 
report  describes  a  general  methodology  for  calculating  maximum-likelihood 
estimates  of  reliability  as  well  as  90-percent  lower  confidence  bounds  on  the 
system  reliability  for  general  systems  representable  ss  configurations  of 
series/parallel  circuits. 

In  testing  these  weapon  systems,  because  of  the  scarcity  and  cost  of 
some  of  the  components,  the  tester  must  be  quite  selective  in  the  number  and 
type  of  subsystems  to  be  included  in  field  tests.  An  important  byproduct  of 
the  methodology  to  be  presented  is  that  it  evinces  those  components  that  are 
critical.  In  that  they  constrain  the  lower  confidence  bounds,  and  those  that 
are  not.  Therefore,  it  would  be  highly  cost-effective  to  schematize  the  sys¬ 
tem  in  the  format  of  this  methodology  before  testing  has  begun,  so  that  the 
test  director  can  effectively  allocate  his  test  resources  to  the  critical 
components. 

The  next  section  discusses  the  methodology  for  calculating  confidence 
bounds  on  circuit  system  reliability  in  a  completely  general  way.  It  is  hoped 
that  this  section  will  serve  as  a  handy  reference  to  the  analyst  who  desires 
to  make  confidence-bound  determinations  for  many  types  of  circuit  systems. 
For  example,  the  methodology  should  be  readily  applicable  to  various  kinds  of 
sensors,  radars,  and  missile  guidance  systems.  In  later  sections,  the  method¬ 
ology  is  applied  to  a  system  of  the  type  of  an  actual  weapon  fuzing  system. 
Based  on  simulated  test  data,  90-percent  lower  confidence  bounds  on  system 
reliability  are  calculated  and  critical  components  are  identified. 


2,1  EVALUATION  OF  CONFIDENCE  BOUNDS,  By  a  90-percent  lower  confidence 
bound  on  system  reliability  is  meant  a  statistic  computed  from  the  test  data 
with  the  property  that  there  is  at  least  a  probability  of  0.90  that  this 
statistic  is  lower  than  the  unknown  system  reliability.  Under  the  assumption 
that  tests  on  a  component  are  binomial ,  that  is,  the  tests  are  independent 
with  constant  failure  probability,  the  90-percent  lower  confidence  bound, 
LCB^q,  on  component  reliability  is  computed  as  follows  when  the  test  data 
indicate  N  tests  with  F  failures; 


lcb90  -  b90(n,f)  -  1  -  p 


(1) 


where  p  satisfies  the  binomial  relationship 

!  (?)  P^l  -  P)N_i  -  0.90 

i-F+1 

or,  equivalently, 

l  (?)  P^l  -  p)^1  -  0.10  . 

i-0  1 

These  formulas  assume  that  N  and  F  are  integers;  in  calculating 
equivalent  components  later,  there  will  be  a  need  for  evaluating  90-percent 
lower  confidence  bounds  when  N  and/or  F  are  not  integral.  In  this  case,  the 
following  linear  interpolation  formula  is  useful: 


B90(N,F)«  (1  -  N^Kl  -  Fd)B9q(Ki,Fi)  +  FpB^CNj.Fj.  +  1)] 


+  -  FD)B90(NI  +  +  FDB90(NI  +  l'FI  +  03 


(2) 


where 


Nj  *  [N] ,  the  integer  part  of  N, 

ND  -  N  -  [N] , 

Ft  ”  IF],  the  integer  part  of  F,  and 
FD  -  F  -  |F]. 


m 
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Tables  (1]  are  available  from  which  binomial  confidence  bounds  can  be 
read  for  N  =  1  to  150.  In  the  calculation  of  equivalent-system  lower 
confidence  bounds  later  in  this  paper,  we  shall  frequently  encounter  very 
large  values  of  N  together  with  very  small  values  of  F.  To  obtain  such  lower 
confidence  bounds,  the  Poisson  approximation  to  the  binomial  is  used  when  N  > 
150  and  F  <  10.  As  long  as  F  is  an  integer,  regardless  of  whether  or  not  N  is 
an  integer,  the  Poisson  estimate  is  given  by 

p90<f)~  2-xg.90^2F  +  2)  * 

2 

where  x~  gg(2F  +  2)  denotes  the  90th  percentile  cf  a  chi-square  distribution 
with  2F  t  2  degrees  of  freedom.  Tables  of  the  chi-square  percentiles  can  be 
found  in  many  statistics  textbooks  (see  [2]).  When  F  is  not  an  integer, 
P9q(F)  may  be  calculated  by  linear  interpolation: 

WF>~  '  FD)P90(Fl)  +  FDP9o(FI  +  0  •  (4> 

In  either  case,  the  SO-percent  lower  confidence  bound  is  then  estimated  from 
the  formula 

P90(F) 

LCB90  ■  B90<S*F>  *  1  -  -T—  •  (5) 

Another  useful  formula  for  calculating  component  lower  confidence  bounds 
arises  from  the  observation  that,  when  F  »  0  in  equation  (1),  we  have 

[b90(n,o)]n  -  (1  -  p)N  -  0.10 

which  leads  to  the  exact  solution 

%<N,0)  -  (0.10) 1/N  .  (6) 

It  should  be  clear  that  all  the  preceding  formulas  can  be  readily 
extended  to  the  computation  of  component  lower  confidence  bounds  at  other  than 
the  90-percent  level. 

The  next  several  sections  describe  techniques  for  calculating  .lower  con¬ 
fidence  bounds  for  general  series  and  parallel  configurations  of  components. 
These  procedures  are  taken  from  those  recommended  by  a  special  Working  Group 
chaired  by  the  Army  Materiel  Systems  Analysis ' Activity  [3]. 

2.2  CALCULATION  OF  CONFIDENCE  BOUNDS  FOR  A  SERIES  SYSTEM  OF 
NONREPEATED  COMPONENTS.  The  simplest  case  is  a  system  whose  configuration 
consists  of  a  series  arrangement  of  independent  components,  as  exemplified  in 
figure  1. 


Test  Data:  Ni  Tests 

Si  Successes 
Fi  Failures 


N2  Tests 
S2  Successes 
F2  Failures 


Nj  Tests 
Sj  Successes 
F;  Failures 


N jy  Tests 
S^  Successes 
F„  Failures 


Figure  1.  Series  system  of  nonrepeated  components. 


Nona  of  the  j\f  components  in  this  series  are  repeated;  that  is,  all  are 
independently  functioning  components  which  appear  only  once  and  have  specific 
test  data  in  terms  of  observed  successes  and  failures.  Note  that  = 
N^  for  all  values  of  i* 


The  lower  confidence  bound  on  the  reliability  of  this  series  is  obtained 
by  reducing  the  combination  to  an  equivalent  component.  This  is  done  by  means 
of  the  Linds trom-Madden  method  [4]  which  calculates  the  maximum-likelihood 
estimate  of  the  system  reliability,  Rg,  by  the  formula 


R 


s 


-  n 

H  = 


(7) 


and  takes  the  equivalent  number  of  tests,  N,  for  the  system  to  be 


N  =  min  . 

1<i  <JV  (®) 

The  equivalent  number  of  successes  and  failures  of  the  system,  S  and  F  respec¬ 
tively,  are  then  given  by 


S  ■  NRg  ,  (9) 

F  -  N(l  -  Rg)  .  (10) 


Thus  the  series  combination  is  now  represented  by  a  single  equivalent 
component  with  S  successes  and  F  failures  out  of  N  tests.  The  90-percent 
lower  confidence  bound  for  the  series  combination  can  now  be  computed  by  the 
methods  of  section  2.1. 


For  example,  consider  the  three  components  in  series  in  figure  2 


«.\  u*  ,v>Ju.^.^.uiJ.i«  r>*  ,«u* 


Component  1 


Component  2 


Component  3 


Figure  2.  Example  of  series  system; 


The  computational  procedure  gives  the  following: 


Step  1.  R 


i  48 
1  x  jo  x  1 


0.96 


Step  2.  N  =*  min(25,  50,  63)  -  25 

Step  3.  S  =  0.96(25)  -  24 
F  -  0.04(25)  -  1 

Step  4.  LCBqn  ■  Bqn(25,  1)  *  0.853  (from  eq  (1)  and  table  lookup) 


2.3  CALCULATION  OF  CONFIDENCE  BOUNDS  FOR  A  PARALLEL  SYSTEM  OF 
NONREPEATED  COMPONENTS .  For  a  system  configured  as  in  figure  3  with 

independent,  nonrepeated  components  in  parallel,  an  equivalent  single 
component  is  again  derived.  The  equivalent  number  of  tests,  N,  is  computed 
from  the  equation  F 


N  = 


1  -  Q* 
O'  -  f 


(ID 


where 


n?- 

i=1  1 


H  Fi  +  1 

q  =  n  ni  + 1 

i=1 


i<(>  { Al 

Nj  =  Number  ot  Tests 
Sj  =  Number  of  Successes 
Fj  =  Number  of  Failures 


Figure  3.  Parallel 
system  of  nonrepeated 
components. 


aRd  the  maximum  likelihood  estimate  of  the  system 
reliability  is  then  given  by: 


1  -  Q 


(12) 


The  equivalent  numbers  of  successes  and  failures  are  then 
derived: 


S  »  NR 


‘s 


NQ 


(13) 

(14) 
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The  90-percent  lower  confidence  bound  for  the  system  reliability  can  now  be 
computed  as  that  for  the  equivalent  single  component  with  F  failures  out  of  N 
tests. 

An  example  of  a  parallel  system  is  given  in  figure  A.  The  computational 
steps  proceed  as  follows; 


Step  1. 


Step  A. 
Step  5. 


0  X  20  X  30  "  ° 


11  x  21  X  31 


0.000838 


Step  3.  N  «  ~_-^q  *  1192.5 


1192.5 


Step  6.  LCB9q  -  B9QC 1192.5,  0)  -  (0.10) 


1/1192.5  _  0#9981 
(from  eq  (6)) 


Component  1 
H-t  ==  10 

h  =  o 

Component  2 
N2  =  20 

~  F2  =  2  ' 

Component  3 
H3  =  30 
“  F3  =  1 


Figure  4.  Example  of  parallel  system. 


2 .A  CONSTRUCTION  OF  AH  EQUIVALENT  COMPONENT  WITS  SPECIFIED  RELIABILITY 
AND  CONFIDENCE  BOUND.  In  reducing  a  complex  combination  of  components  to  an 
equivalent  single  componeut,  a  sequence  of  substitutions  may  be  required.  It 
may  occur,  in  the  techniques  to  be  developed  in  subsequent  sections,  that  some 
reductions  will  calculate  the  maximum-likelihood  estimate  of  reliability  as 
well  as  the  lower  confidence  bound  for  a  subsystem  without  specifying  the 
equivalent  test  ..data.  Therefore,  it  will  be  useful  in  the  sequel  to  have  a 
technique  for  constructing  the  equivalent  test  data  for  a  subsystem  when  given 
only  the  maximum-likelihood  estimate  of  reliability,  R,  and  the  90-percent 
lower  fconf idence  bound,  Bgg. 

The  technique  for  solving  for  the  equivalent  number  of  tests,  N,  and 
failures,  F,  given  R  and  B90,  is  actually  just  the  solution  of  the  following 
two  equations  in  two  unknowns: 

R  “  1  "  £  ' 

®on  *  Bar»(N,F)  . 
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However,  the  second  of  these  equations  cannot  be  solved  explicitly,  so  an 
iterative  approach  to  solution  is  used. 

The  iteration  begins  with  an  initial  estimate  of  N,  denoted  by  N^, 
calculated  from  the  formula 


N 


-  In  0.10 
1  -  In  B90  * 


(15) 


If  the  reliability  estimate  R  is  equal  to  1,  set  N  =  N^  and  F  **  0.  If  not,  an 
estimate  of  F,  denoted  by  F^,  is  obtained  from 


F<  =  (1  -  R)N1 


(16) 


and  the  confidence  bound  BgQ,(Nj,Fj)  is  determined  by  the  techniques  in  section 
2.1.  An  adjustment  factor" given  by 

in  B90(N1fFl) 


t  = 


In  B, 


90 


(17) 


is  used  to  obtain  the  next  estimate  of  N,  denoted  by  Nj: 


N2  =  tN1  .  (18) 

If  the  adjustment  factor  is  near  enough  to  1  (i.e..  It  -  1|  <  0.01),  then  use 
N  =  N2  and  F  *  (1  -  R)^  as  the  equivalent  test  data.  If  not,  N2  is  taken  as 
the  estimate  of  N  and  the  above  process  (eq  (16)  through  (18))  is  repeated 
until  the  adjustment  factor  converges  close  enough  to  1,  resulting  in  the 
equivalent  values  of  N  and  F.  This  procedure  is  illustrated  in  the  next 
section. 

2.5  CALCULATION  OF  CONFIDENCE  BOUNDS  FOR  A  SYSTEM  CONSISTING  OF  ONLY  A 
SINGLE  COMPONENT  REPEATED  IN  ANY  CONFIGURATION .  This  section  describes  the 
methodology  to  be  used  for  calculating  confidence  bounds  for  a  system  or 
subsystem  which  is  a  combination  of  series  and/or  parallel  circuits  composed 
solely  of  repetitions  of  the  same  component.  More  precisely,  the  components, 
although  separate  physical  devices,  are  the  same  in  the  sense  that  they  are  of 
the  same  generic  type  and  are  described  by  the  same  test  data. 

Suppose  the  system  to  be  an;  .yzed  is  a  series/parallel  configura¬ 
tion  consisting  of  repetitions  of  a  component,  C,  characterized  by  test  data 
indicating  F£  failures  in  Nc  tests.  The  maximum-likelihood  estimate  for  the 
reliability  of  the  component,  C,  is  given  by 


(19) 


ye? 


tvttw  mrwiip  crfcv.v«,;nitrviiv  zr 


vkstv  irv  trw  -u  :rv  w  ur-w  :/v_ -*  ■ 


-v  ■'sr.'Vi's  * •% *j-n ■  ="w_v T*-vr 


Analysis  of  the  system  into  itc  series  and  parallel  branches  of  C  components 
gives  rise  to  a  reliability  estimate  for  the  system  which  is  a  function  of  Rc: 


=  f(RJ  • 


(20) 


For  example^,  if  the  configuration  consisted  of  n  components  C  in  series,  f(R  ) 
would  be  K*,  whereas  if  the  configuration  were  n  components  C  in  parallel, 
f(Rc)  woulf  be  1  -  (l  -  Rc)n. 

To  calculate  confidence  bounds  for  the  general  series/parallel 
configuration  of  C  components,  the  methodology  begins  by  evaluating  the  90- 
percent  lower  confidence  bound  for  C: 


LCB 


■  B9o(“e’Fc)  ' 


The  90-percent  lower  confidence  bound  for  the  system,  LCBg,  is  then  calculated 
by  means  of  the  function  in  equation  (20): 


LCBS  =  f(LCBj  . 


(21) 


Therefore,  we  have  obtained  the  maximum-likelihood  estimate  of  reliability  (eq 
(20))  and  the  90-percent  lower  confidence  bound  (eq  (21))  for  the  system. 
Equivalent  test  data  for  the  system  (that  is,  Ng  and  Fg)  can  now  be  calculated 
by  the  method  of  section  2.4. 


For  the  special  case  where  the  configuration  of  the  system  is 
just  a  series  arrangement  of  n  repeats  of  C  and  where  F  is  small  compared  to 
Nc  (that  is,  Fc  <  Nc/10),  two  simple  but  accurate  approximations  for  Ng  and  Fg 
are  available.  Both  of  these  approximations  are  conservative  in  that  they 
tend  to  underestimate  Ng: 


Approximation  i : 

N  •>  — 2- 

s  n 

(22) 

Fs  =  0  ~  rsK 

(23) 

Approximation  2: 

ti  -  Fc 

NS  1  -  R 

(24) 

s 

Fs  -  0  -  rsK  =  Fc 

(25) 

Note  that  the  second  approximation  cannot  be  used  when  Rg  -  1  (or, 
equivalently,  Fc  «  0),  but  in  this  case  the  first  approximation  yields  exactly 
the  same  values  as  the  general  method  in  section  2.4,  since 


from  equation  (15) 


from  equation  (6) 


These  approximations  are  often  useful  in  the  reduction  of  a  complex  system 
with  series  subsystems  to  an  equivalent  system. 

As  an  example,  consider  the  series/parallel  configuration  in 
figure  5,  where  Nc  «  15  and  Fc  *  1.  Computations  proceed  as  follows: 

Step  1.  Rc  =  1  ~  Tc  =  0.93333 

Step  2.  Rg  =  [l  -  ^1  -  Rc)2][l  -  (l  -  Rc)3]  =  0.99526 
Step  3.  LCBc  -  B90(15,  1)  -  0.7643 

Step  4.  LCBg  -  [1  -  (1  -  LCBc)2)[1  -  (l  -  LCB^3]  -  0.93206 

Step  5.  The  ite.ative  method  of  section  2.4  with  R  -  0.99526  and 
Bo0  "  °«93206  then  gives  the  following  table,  where  the 
Bqn(N1 ,F. )  valuer  are  obtained  by  the  interpolation 
formula  (2). 


Iteration  N1  F1  Sc\Vl)  t  Nj~ 

1  32.73  0.155  0.9247  1.112  36.42 

2  36.42  0.173  0.9314  1.011  36.81 
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36.81 


0.174 


0.9320 


0.9997  36.80 


Figure  5.  Serieo/parallel  config¬ 
uration. 


by 


Consequently,  the  equivalent  test  data  for  the  system  ^re  given 


Ns  -  36.80 

F8  -  (1  -  R8)N8  -  0.174  . 

2.6  CALCULATION  OF  CONFIDENCE  BOUNDS  FOR  GENERAL  CONFIGURATIONS — METHOD 
FOR  UNPOOLING  DATA.  The  techniques  discussed  so  far  permit  the  calculation  of 
lower  confidence,  bounds  on  system  reliability  for  series/parallel  systems  of 
Independent,  nonrepeated  components,  as  well  as  for  systems  which  contain 
repeated  component  types,  as  long  as  each  repeated  component  appears  in  only 
one  subsystem.  In  order  to  handle  configurations  in  which  repeated  components 
are  distributed  throughout  several  subsystems  in  combination  with  other 
repeated  or  nonrepeated  components,  a  method  will  be  described  for  unpooling 
the  data  for  repeated  components.  This  method  divides  the  component  test  data 
into  grorns  corresponding  to  the  various  subsystems  in  which  the  component 
appears,  and  then  treats  the  component  as  distinct  and  independent  within  each 
subsystem.  It  has  been  found  that  such  unpooling  schemes  provide  somewhat 
conservative  lower  confidence  bounds  on  reliability. 


The  basic  idea  behind  the  unpooling  method  is  as  follows.  Sup¬ 
pose  C  is  a  component,  with  test  data  indicating  Fc  failures  in  Nc  tests, 
which  occu'8  in  n  subsystems,  where  the  subsystems  are  chosen  to  each  contain 
as  many  appearances  of  C  as  possible  and  still  be  analyzable  by  the  techniques 
of  sections  2.2  through  2.5.  Thus  each  subsystem  either  contains  just  one 
appearance  of  C  or,  if  it  contains  two  or  more  appearances,  that  portion  of 
the  subsystem  can  be  reduced  to  a  configuration  composed  of  repetitions  of  a 
single  equivalent  component.  The  component  C  will  be  relabeled  as  Co, 
...,  Cn,  respectively,  for  each  of  the  n  subsystems  in  which  it  appears.  The 
test  data  for  C  is  then  allocated  over  the  n  subsystems  in  such  a  way  as  to 
keep  the  maximum-likelihood  estimate  of  reliability  for  each  C^,  i  -  1,  2, 
...,  n,  equal  to  that  for  C.  That  is,  the  constraints  on  the  unpooling  are 


n 


N 


C  ' 


for  i  *  1 ,  2,  . . . ,  n  . 


m 

ir.i 
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There  are  many  ways  of  unpooling  which  satisfy  these  constraints. 
The  method  used  here  unpools  according  to  the  following  scheme: 


(1) 

(2) 

(3) 

etc 


Unpool  equally  in  a  series  direction. 

Then  unpcol  equally  in  a  parallel  direction. 
Then  unpool  equally  in  a  series  direction. 


This  sequence  is  best  illustrated  by  an  example,  as  shown  in  figure  6.  In 

this  system  the  component  C  appears  in  four  subsystems  and  has  been  relabeled 

accordingly.  The  first  step  of  the  unpooling  would  allocate  N  /2  and  Fc/2  to 

Cj  and  the  other  Nc/2  and  F  /2  to  the  parallel  combination .  Since  there  are 

two  branches  in  parallel,  tne  second  step  of  the  unpooling  would  divide  in 

half  the  equivalent  test  data  for  the  parallel  combination,  thus  allocating 

Nc/4  and  Fc/4  to  and  the  other  Nc/4'  and  Fc/4  to  the  series  combination 

containing  Co  and  Co.  In  turn  the  third  step  of  the  impooling  allocates  N  /8 
.  c  -  -  -  •  .... 


and  Fc/8  to  each 
appearance  of  C  would 


ot  Co 

id  b 


and  C3.  In  summary,  the  unpooled  test  data  for  each 
e  as  follows. 


Component 

Test 

data 

N 

F 

C1 

V2 

V2 

C2 

V8 

Fc/8 

C3 

V8 

Fc/8 

C4 

V4 

V4 

Total: 

Nc 

Fc 

After  unpooling,  each  of  the  C^'s  is  treated  as  a  separate,  independent  compo¬ 
nent  and  the  techniques  in  sections’  2.2  through  2.5  are  applied,  as  appropri¬ 
ate,  to  each  of  the  subsystems. 


Figure  6.  Example  of  unpooling  scheme 
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2.7  REPRESENTATION  OF  "TWO  OUT  OF  THREE"  DECISION  GATE.  Synthesis  of 
the  techniques  presented  so  far  enables  one  to  calculate  lower  confidence 
bounds  on  reliability  for  any  series /parallel  configuration.  However,  many  of 
the  sophisticated  circuits  of  today  contain  other  configurations,  such  as 
decision  gates,  to  gain  greater  reliability  and  efficiency.  A  straightforward 
procedure  will  be  formulated  to  handle  decision  gates  by  approximate 
equivalent  combinations  of  series  and  parallel  circuits c 

The  methodology  will  be  illustrated  for  a  "two  out  of  three"  decision  gate; 
the  extension  to  general  "k  out  of  a'*  decision  log's  gates  should  be  clear. 

First,  observe  that  for  a  series  combination  of  components  C. , 
C2,  ...»  CR,  with  component  failure  probabilities  Q_  ,  Q  ,  ...,  Q  ,  the 
failure  probability  of  the  combination,  0,  is  given  by  1  2  CK 

c  •  1  -  (1  -  0^1  -  <b2)  ...  (l  -  Qck) 

=Qq  +  Qq  +...+Qq  +  second  and  higher  order  terms  . 

1  2  K 

Mission  reliability  equations  for  modern  weapon  systems  typically  neglect  the 
second  and  higher  order  terms  and  nimply  add  together  failure  probabilities  of 
components  in  series.  On  the  other  hand,  if  Cj,  C,,  were  in  parallel, 

the  failure  probability  for  the  system  would  be,  simply. 


2  =  Qc  •  •  •  Or  • 

1  ^2  K 


For  a  decision  gate  configuration  which  requires  success  in  two 
(or  more)  of  the  three  branches  (with  each  branch  consisting  of  the  same 
component  C)  for  a  YES  vote,  the  probability  of  failure,  Q,  of  the  gate  (i.e., 
a  NO  vote)  is  given  by 


Q  *  probability  that  2  or  3  branches  fail 

“  3Q$  +  $  * 

where  Qc  is  the  failure  probability  of  the  component  C.  In  terms  of  failure 
probability,  the  decision  gate  is,  therefore,  approximately  equivalent  to  the 
series/parallel  combination  shown  in  figure  7,  which  has  a  failure  probability 
given  by 


2=^  +  2c+^+®C  +  f°urth  and  higher  order  terms  . 


Since  the  terms  omitted  by  Introducing  this  approximation  are  two  orders  less 
than  those  already  typically  neglected  in  the  mission  reliability  equation, 
this  series/parallel  combination  should  afford  a  sufficiently  accurate  repre¬ 
sentation  of  the  ‘decision  gate. 


Figure  7,  Decision  gate 
approximate  equivalent  com¬ 
bination. 


3.  CASE  STUDY  -  APPLICATION  OF  THE  METHODOLOGY.  The  methodology 
described  in  the  previous  sections  will  now  be  applied  to  an  example  based  on 
the  actual  fuzing  system  of  a  battlefield  weapon.  The  system  schematic,  shown 
in  figure  8,  is  at  the  same  level  of  sophistication  as  the  fuzing  system. 
However,  for  purposes  of  keeping  this  report  unclassified,  a  few  modifications 
have  been  made  to  the  actual  schematic  and  simulated  component  test  data  is 
used.  Note  the  "two  out  of  three"  decision  gace  equivalent  in  the  upper  right 
hand  part  of  the  system  schematic  in  figure  8.  The  simulated  component  test 
data  is  displayed  in  table  1.  For  some  of  the  components  only  a  reliability 
value,  R,  is  available,  presumably  based  on  a  large  number  of  tests  by  the 
manufacturer;  such  components  are  denoted  by  an  asterisk  in  figure  8. 

The  lower-confidence- bound  computation  for  this  system  will  proceed 
through  two  reductions  of  the  syr^em,  unpooling  into  subsystems  and  calcula¬ 
tion  of  equivalent  components,  and  then  the  calculation  of  the  system  lower 
confidence  bound  itself.  In  the  process,  components  critical  to  the  confi¬ 
dence-bound  assessment  will  be  evinced  and  pertinent  observations  made. 

In  the  first  reduction  many  of  the  series  combinations  which  are  re¬ 
peated  in  a  particular  type  of  configuration  throughout  the  system  schematic 
are  simplified.  The  computations  are  sketched  in  appendix  A.  Note  that  those 
components  which  have  reliability  estimates  only  are  treated  as  having  essen¬ 
tially  an  infinite  number  of  trials;  thus  they  do  not  affect  the  calculation 
of  the  equivalent  component  N  (number  of  trials)  but  only  the  calculation  of 
".he  equivalent  component  R  (reliability).  After  the  first  reduction,  the 
system  schematic  takes  the  form  shown  in  figure  9. 


FIGURE  8.  SYSTEM  SCHEMATIC 


TAILS  1.  SIMULATED  COMPONENT  TEST  DATA 


Coaponant _ Huabar  of  Tr lata _ Nuabat  of  Pollute* 


A 

I 

C 

D 

E 

T 

C 

R 

I 

J 

K 

L 

M 

N 

0 

P 

Q 

t 

s 

T 

U 

V 

u 

X 

T 

z 

a 


5000 

201 

X  -  .995 
X  -  .99* 
X  -  .9997 
192 
192 
401 

X  .9978 
X  -  .97 
X  -  .999 
X  -  .999 
X  *  .999 
573 
573 
573 
570 
572 
250 
328 
384* 

384 

383 

383 

384 

»2 

*01 


8 

T 

8 

c 

{ 

1 

• 

> 

K 

X 


1260 

1260 

384 

382 

382 

381 

381 

399 

371 

375 


V 

V 

1 

p 

0 

t 

4 


-  .992 

-  .998 

-  .9998 

-  .9999 

-  .9955 

-  .9995 

-  .9982 


Pot  *  pttalUl  pair  o£  0  coaponanta 


0 

0 


0 

0 

0 


0 

0 

0 

1 

0 

15 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

3 

0 

0 

0 

0 

0 
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FIGURE  9.  SYSTEM  SCHEMATIC 
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The  primary  function  of  the  second  reduction  Is  to  consolidate  the  long 
series  of  components  at  the  beginning  of  the  schematic  In  figure  9.  After 
these  substitutions  are  made,  the  reduced  schematic  assumes  the  tractable  form 
shown  in  figure  10.  Details  of  the  reduction  procedure  are  given  in  appendix 
B. 


The  reduced  schematic,  divided  into  subsystems  as  shown  in  figure  10, 
can  now  be  treated  by  applying  the  methodology  developed  previously  to  each  of 
the  numbered  subsystems  and  then  determining  the  equivalent  N  and  R  for  the 
overall  series  configuration  of  subsystems.  However,  the  data  must  first  be 
unpooled  for  components  which  appear  in  more  than  one  subsystem.  The  compo¬ 
nents  which  appear  in  the  reduced  schematic,  before  unpooxing,  are  listed  in 
table  2  along  with  their  equivalent  test  data.  The  equivalent  test  data  after 
unpooling  are  shown  in  table  3.  Note  that  those  components  which  appear  in 
more  than  one  subsystem  have  had  an  extra  subscript  appended  to  indicate  those 
repetitions.  (For  example,  refers  to  the  third  distinct  appearance  of  V^, 
in  the  top  branch  of  subsystem  6.) 


The  equivalent  number  of  trials,  N,  and  the  maximum-likelihood  reliabil¬ 
ity  estimate,  R,  are  computed,  subsystem  by  subsystem,  in  appendix  C  and  tabu¬ 
lated  in  table  4.  Since  the  overall  system  configuration  is  now  represented 
as  a  series  combination  of  these,  subsystems,  the  maximum-likelihood  estimate 
of  the  overall  system's  reliability  is  just  the  product  of  the  subsystem  reli¬ 
abilities  (R  -  0.9824),  and  the  equivalent  number  of  trials  is  the  minimum  of 
those  for  the  subsystems  (N  *  165).  This  minimum  number  (indicated  by  an 
asterisk  in  table  4)  corresponds  to  the  critical  subsystem — that  which 
delimits  the  equivalent  number  of  trials.  Note  how  only  a  few  subsystems,  and 
thus  only  a  few  components,  may  determine  the  calculation  of  the  confidence 
bound.  Examination  of  the  critical  subsystem  8  identifies  the  critical 
component  of  the  overall  system  (i.e.,  that  component  for  which  additional 
test  data  could  increase  the  equivalent  number  of  trials  for  the  overall 
system  and  hence  improve  the  resulting  lower  confidence  bound  ),  to  be  the  Z 
component . 


FIGURE  10.  REDUCED  SCHEMATIC 
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Coaponent 

TABUS  2.  EQUIVALENT  TEST  DATA  — 

Nuaber  of  Trial* 

BEFORE  unfooling 

Nuab«r  of  Failure* 

h. 

192 

3.130 

"l 

573 

0 

Q 

570 

1 

R 

572 

0 

S 

250 

15 

T 

328 

1 

V1 

383 

0.997 

Z 

92 

0 

°t 

401 

0.401 

« 

384 

0 

«J 

10,961 

0.680 

®i 

371 

3.313 

Coaponent 

TABLE  3.  EQUIVALENT  TEST  DATA  - 

(Juaber  of  Trial* 

-  AFTER  UNPOOLInG 

Nuaber  of  Failure* 

A2 

192 

3.130 

"11 

143.25 

0 

"l2 

143.25 

0 

"l3 

143.25 

0 

"14 

143.25 

0 

9t 

190 

0.3333 

92 

190 

0.3333 

93 

190 

0.3333 

R 

572 

0 

S1 

83.33 

5 

S2 

83.33 

5 

s3 

83.33 

5 

T1 

109.33 

0.3333 

T2 

109.33 

0.3333 

T3 

109.33 

0.3333 

vn 

95.75 

0.24925 

V12 

95.75 

0.24925 

V13 

47.875 

0.124625 

V14 

47.875 

0.124625 

VH 

95.75 

0.24925 

Z1 

18.4 

0 

Z2 

18.4 

0 

z3 

18..4 

0 

Z4 

18.4 

0 

z5 

18.4 

0 
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TABLE  3.  EQUIVALENT  TEST  DATA  —  AFTER  UNPOOLIHG  (COHT'D) 


Component 


Number  of  Trial* 


Number  of  Failure* 


“tl 

80.2 

0.0802 

a12 

80.2 

0.0802 

°13 

80.2 

0.0802 

80.2 

0.0802 

“15 

80.2 

0.0802 

S1 

192 

0 

«2 

192 

0 

e2 

10,961 

0.680 

®21 

18S.S 

1.6565 

e22 

185.5 

1.6565 

Subsyatem 

TABLE  4.  SUHHARY  OF  SUBSYSTEM 

R 

DATA 

Equivalent 

N 

A2 

0.98370 

192 

c2 

0.999938 

10,961 

1 

0.999991 

2,158 

2 

0.9998891 

1,166 

3 

0.9999884 

2,040 

A 

1 

13,919 

5 

1 

15,989 

6 

0.9989715 

478 

7 

0.9999323 

222 

8 

1 

165* 

9 

1 

737 

10 

0.999999 

2,586 

Syatem  0.9824 

R  -  Maximum-likelihood  eatlmate  of  reliability 

165 

N  •  Number  of  trial* 


The  data  are  now  in  place  to  calculate  the  90-percent  lower  confidence 
bound  on  the  reliability  of  this  example  system.  We  have: 

R  =  0.9824 

N  -  165 

F  -  (l-R)N  *  2,904 

The  interpolation  formula  (2)  and  the  Poisson  estimate  (3)  give  the  90-percent 
lower  confidence  bound: 


Note  that  R  is  a  point  estimate  of  the  reliability  of  the  system,  whereas  the 
lower  confidence  bound  is  a  bound  on  the  unknown  actual  sy<  Tm  reliability, 
not  on  the  point  estimate. 

In  summary,  the  general  methodology  described  in  this  report  has  been 
utilized  to  estimate  the  system  reliability  of  a  practical  weapon  system 
design,  to  obtain  a  90-percent  lower  confidence  bound  on  the  system  relia¬ 
bility,  and  to  determine  those  system  components  which  are  prime  candidates 
for  further  design  tests. 
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APPENDIX  A 


FIRST  REDUCTION 


In  each  replacement  of  a  subsystem,  show  in  figure  8  in  the  body  of  the 
report,  by  an  equivalent  component,  both  the  original  subsystem  and  the  new 
equivalent  component  will  be  depicted.  The  methodology  used  for  the  reduction 
will  be  referred  to  by  the  appropriate  section  in  the  body  of  the  report.  The 
symbols  N,  F,  R,  and  Q  will  be  •  used  throughout  to  denote  number  of  tests, 
number  of  failures,  maximum  likelihood  reliability  estimate,  and  failure 
probability,  respectively. 


Original  subsystem 


(method  of 
sect.  2.5. 
eg  (22)) 


5JE 


jWr 


0.0009 


Equivalent  component 


Original  subsystem 


Cl  -  0.001 
0^  «  o.oot 


nn  = 

573 

N0  «  573 

Np  =  573 

(method 

*■0 

z 

ii 

0 

F0  =  0 

Fp  =  0 

sect.  2 

rn  = 

1 

R0  =  1 

50 

U 

II 

\  “  1 

573 

0 


Original  subsystem 


Equi va lent  component 


Na  *  401  Rf|  ■  0.999  Hg  -  1260  »  1260  B6th°d 

_  sect.  2.2) 


=  382  Nn  =  381 


(method  of 


*  0 


sect.  2.2) 


N0  =  381  Hx  =••  399  Nk  =  371  Nx  =  375 


(method  of  ^  __  ^71 


Fo  =  0 


F1  =  0 


fk  “  0  fa  *  0 


sect*  2* 2)  p  —  q 
ei ' 


*This  warhead  component  is  repeated  in  other  subsystems  but ,  since  it  is 
being  treated  as  having  essentialy  an  infinite  number  of  trials,  it  cannot 
affect  calculation  of  the  equivalent  N  and  so  it  can  be  treated  as  independ¬ 
ent,  affecting  only  iiie  calculation  o*  R. 


A  A-  A  V*  /.  /*  % . 


L,' «  ir.iL-v\LV  -j-.-l.-v  -jvlthuv  jw  l- 


iOTT  CV  £TK  ;jV  T.~V  iZ'V  -T*  ; 
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APPENDIX  B 


SECOND  REDUCTION 


In  each  replacement  of  a  subsystem,  shown  in  figure  9  in  the  body  of  the 
report,  by  an  equivalent  component,  both  the  original  subsystem  and  the  new 
equivalent  component  will  be  depicted.  The  methodology  used  for  the  reduction 
will  be  referred  to  by  the  appropriate  section  in  the  body  of  the  report.  The 
symbols  N,  F,  R,  and  LCB  will  be  used  throughout  to  denote  number  of  tests, 
number  of  failures,  maximum  likelihood  reliability  estimate,  and  90-percent 
lower  confidence  bound,  respectively. 


Original  subsystem 


Equivalent  component 


Rt  -  Rj  -  -  0.998999 

0.9978  0.9991  0.999 


( Method  of 
Met.  2.2) 


H,  «  192 

a2 

-  0.98370 

F.  «  3.130 
"2 


R_  »  0.99215 
e1 


(method  of 
sect.  2.5) 


N_  =  10,961 
e2 

F.  =  0.680 
e2 
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LC;Be^  ==  0.98244  by  the  Poisson  estimate  (eq  (5)) 


1  -  (l  -  Re^2  =  0*999938 
1  -  -  iCBg^2  =  0.999692 


The  iterative  method  of  section  2.4  is  used  to  find  Ue  and  F£  ,  producing  the 
results  in  the  following  table.  2  2 


Iteration 

N1 

F1 

B90^“1  'F1  j 

t 

N2 

1 

7,475 

0.4635 

0.999593 

1.32 

9,878 

2 

9,878 

0.6124 

0.999668 

1.078 

10,648 

3 

10,648 

0.6602 

0.999685 

1'.  023 

10,890 

4 

10.-890 

0. 6752 

0.999690 

1.006 

10,961 

From  these  results, 


N  =  10,961  , 


’•-2  -  ('  -  %K  -  °-68° 


Original  subsystem 


Equivalent  component 


N„  *  371  R„  =  0.99107 
Pi 

pa,  -  0 


RH-  =  0.99107 


APPENDIX  C 


ANALYSIS  OF  SUBSYSTEMS 


The  equivalent  number  of  trials  and  the  maximum-likelihood  reliability 
estimate  will  be  calculated  for  each  of  the  10  subsystems  in  the  reduced 
schematic  in  figure  10  in  the  body  of  the  report.  The  methodology  used  for 
each  subsystem  will  be  referred  to  by  tne  appropriate  section  in  the  body  of 
the  report.  The  symbols  N,  F,  R,  and  LCB  will  be  used  throughout  to  denote 
number  of  tests,  number  of  fallurec,  maximum  likelihood  reliability  estimate, 
and  90-percent  lower  confidence  bound,  respectively. 

SUBSYSTEM  1 

This  subsystem  is  a  series/parallel  configuration  consisting  of  repeti¬ 
tions  of  a  single  series  combination: 


For  this  series, 

N  =  143.25 
R  *  0.99825 

LCB  =  0.98107  (by  interpolation  formula  (2)  in  the  body  of  the 
report 

For  subsystem  1  (using  the  method  of  sect.  2.5),  we  obtain 

Rj  *  [l  -  (1  -  R)2]3[l  -  (1  -  R)3]  =  0.999991  , 


LCBj. 

=  [1  ' 

•  (1  -  LCB) 

Z]3[l  -  (1  -  LCB)3]  =  0. 

998918 

which  leads 

to  the 

results  in 

the  following  tables 

Iteration 

*1 

P1 

b9o(*V'i)  » 

N2 

1 

2127 

0.0191 

0.998905  1.010 

2158 

This  gives  the  final  data  for  subsystem  1 : 

Nx  *  2158  f 
Rj  =  0.999991  , 


ixr-T  »r*r^  r*  J  -  t  . 


';’!3S  YS?i3-: 


For  the  upper  series: 


0- 

1 2 

*2 

Nh  =  143.25 
“12 


"e2  -  190 


Fn  =  0.3333 
y2 


Equivalent  N  =  143.25 
R  =  0.99825 


For  the  lower  series: 


Ns  =  83.33 


95.75 


N„  =  18.4 


F  =  5 
b1 


F„  =  0.24925  F„  =  0 
11  Z1 


N  =  80.2 
°11 


F  =  0.0802 
°11 


Equivalent  N  =  18.4 

R  *  0.93662 


For  subsystem  2  (using  the  method  of  sect.  2.3),  we  obtain 


Q  *  0.00175  x  0.06338 
Q'  «  0.000968  , 


=  0.0001109  , 


P*TI  =  °*  9998891  * 


SUBSYSTEM  3 


Subsystem  3  is  the  sane  as  subsystem  2  except  that  S  is  replaced  by  T. 
A  similar  computation  yields 


“ill  -  2040 


RjH  *  0.9999804  . 


saggagg 


■  k  rkFVl  X  - 


'  ■VITVS-VX-'V".  -  f>  T  <  '  J-'i  T->i -w-« W  JC-  *1  SSS31 3*^®* " 
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SUBSYSTEM  4 


For  the  parallel  pair: 


n51  *  192 


ps,-° 


Hy,  •  143.25 
"14 

■  0 


FN 


14 


By  the  method  of  section  2.3,  we  obtain 

Equivalent  N  »  27838  , 

R  »  1  . 

Subsystem  4  is  just  this  parallel  pair  repeated  twice  in  series, 
approximation  in  equation  (22),  we  have 

27, 838 
2 


By  the 


N 


IV 


13,919  , 


RIV  -  1 


SUBSYSTEM  5 


Subsystem  5  is  just  a  single  component  repeated  in  parallel: 


H.  »  192 


F,  «  0 

®2 


By  the  method  of  section  2.5  in  the  special  case  where  R  »  1,  we  have 


932 


u:b.  =  0.98799 
°2 

=  1  t 


LCBV  =  1  -  (0.01201)2  =  0.999856  , 

„  In  0. 1 0 

"v  “  ’5,989  , 

Ry  =1  . 


SUBSYSTEM  6 


For  the  upper  series: 


NS2  =  83*33  nv13  =  47.875  N 


S  -  ,85-5 


FS2  a  5  FV13  =  °* 1 24625  F, 


a  =  1.6565 
“21 


Equivalent  m  =  47.875 
R  =  0.92918 


For  the  lower  series: 


Nt  =  109.33 
l2 

FT  =  0.3333 
l2 


Nv  =  47.875 
V14 

Fv  =  0.124625 
V1 4 


No  »  185.5 
°2  2 

Fq  =  1.6565 
°22 


Equivalent  N  =  47.875 
R  =  0.98548 


Q  -  0.0010285  , 

Q*  -  0.0031159  , 

1  -  O' 

Nyj  *  *  478  t 

Q'  -  Q 

RVI  «  0.9989715  . 


SUBSYSTEM  7 


First  the  series  repetition  of  V1  is  reduced: 


-  95.75 

V1 5 

F„  -  0.24925 
V15 

Rv  *  0.997397 
v15 


By  the  approximation  in  equation  (24)  in  the  body  of  the  report,  we  have 


Equivalent  N 


15 


1  -  R. 


47.9  , 


15 


F  «  0.24925 


For  the  upper  series: 


S3 

i 

T, 

V15 

r 

3 

V21 

47.9 
0.24925 


N, 


83.33 


-  5 


S 

ft 

a3 


109.33 

0.3333 


H 

F 


Equivalent  H  ■  47.9 

R  *  0.93226 
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For  th*  lower  series: 


N„  «  18.4  N  =  80.2 

^3  °1 3 

F-  -  0  .  F„  0.0802 

Z3  °13 


Equivalent  N  =  18.4 
R  =  0.999 

For  subsystem  7  (using  the  method  of  sect.  2.3),  we  obtain 

Q  0.00006774  , 

Q*  -  0.0045571  , 

NyU  c  222  , 

Ryjj  «  0.9999323  . 


SUBSYSTEM  8 


LCBZ^  *  0.88230  by  interpolation, 
RVIII  *  1  ' 

bCByjn  «  1  -  ^1  -  LCBZ^^2  «  0.98615, 
Myjjj  *  165  , 

Ryjll  -  1  . 


SUBSYSTEM  9 


For  the  upper  eeriest 


N  -  80.2 
“14 

-  0.0802 
®14 

R_  -  0.999 
®14 


By  tlie  approximation  in  equation  (24)  in  the  body  of  the  report,  we  have 


Equivalent  H 


40.1  , 


F  -  0.0802  . 


For  subsystem  9  (using  the  method  of  sect.  2.3),  we  combine  the  upper  series 
in  parallel  with  Z$,  which  has  Nz^  *  18.4  and  F^  «  0,  and  obtain 

Q  -  0  , 

Q*  -  0.0013548  , 

Njx  ■  737  » 
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SUBSYSTEM  10 


N„  =  80.2 
°15 


Frt  =  0.0802 
a15 


By  the  method  of  section  2e5  we  have 


R„  =  0.999 
“15 


MBo  =  0*97011  by  interpolation 

?  ~  =  0.999999  , 


LCBX  = 


1  -  ^1  -  LCB^^^  „  0.999107  . 

,01^  °f  8ection  2*4  is  ^  use*  to  get  equivalent  test  data, 


Iteration 

N1 

F1 

Bgo(Ni rF1 ) 

t 

N2 

1 

2577 

0.002577 

0.999104 

1.003 

2586 

These  results  lead  to  the  following  data  for  subsystem  10: 

Nx  =  2536  , 


as 


Rx  =  0.999999 


■a  *  V*^r«  lr-wv-x.-* 
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B -SPLINES  ON  NONUNIFORM  TRIANGULATIONS 


Charles  K.  Chui 

Center  for  Approximation  Theory 
Department  of  Mathematics 
Texas  A&K  University 
College  Station,  Texas  77843 


ABSTRACT.  C1  B-splines  of  lowest  total  degrees  on  triangulations  of 
nonuniform  xectangular  partitions  are  considered.  Several  interesting 
properties  are  discovered  for  the  type-one  setting.  In  particular,  there 
are  B-splines,  that  is  splines  with  minimum  supports,  whose  supports  are 
concave,  and  B-splines  do  not  necessarily  form  a  partition  of  unity. 


i ♦  INTRODUCTION.  Consider  a  rectangular  region  D  *  (a,b]  *  [c,d] 
and  let  the  lines  x  -  xi  **  0  and  y  -  y^  =*  0,  where  a  **  Xg  <  ...  < 

=  b  and  c  *  yQ  <  ...  <  yn+1  *  d,  partition  D  into  (m+lXn+l) 

rectangular  cells  D^.  By  drawing  in  all  the  upward  sloping  diagonals  of 

these  rectangles,  we  obtain  a  unidiagonal  (or  type-I)  triangulation  of  D, 
and  by  drawing  in  both  diagonals  to  each  D^  we  have  a  crisscross  (or 

type-2)  triangulation.  Of  course,  in  the  special  case  when  x^+J  -  x^  * 

Xi  "  xi-l  and  yj+l  ~  yj  “  yj  “  yj-l  f0C  1  "  1 . .  and  -  *  1 . . 

these  triangulations  of  D  become  3-directional  and  4-directional 
meshes  respectively,  and  many  interesting  results  in  this  special  setting 
have  been  obtained  recently  (cf.  [1],  [2],  [4],  [7],  and  [8],  for 
instance).  This  article  is  a  continuation  of  [3]  where  the  results  on 
nonuniform  crisscross  triangulation  obtained  in  [6]  were  reported.  We 
take  this  opportunity  to  report  two  misprints  in  [3],  namely:  on  page 
879,  B,  should  be  (y,  -  y.  ,)/(y.,+1  '  yi  ,)  and  the  identity  on  page 
881  should  read  3  3  J  3 


n+1 

i 

j=~i 


m+1 

l 

i=*-l 


(-l)1+j(x 


i+1 


"  xi)(yj+l 


"  yj)Bij(x»y)  -  0. 


Of  course  in  the  equally  spaced  setting,  i.e.  on  a  4-directional  mesh, 
this  linear  dependence  relationship  becomes 


n+1 

l 

j— 1 


i  (-l)i+jB  (x,y)  -  0 
i— 1  J 


and  this  special  case  has  recently  been  generalized  by  Dahmen  and 
Micchelli  [8]  to  a  more  general  regular  grid  partition. 


939 


PREVIOUS  PAGE 
IS  BLANK 


^.•T— vr— «r«  a'^rww.i  7^.g 


Kra*  rvn  v  x  v  viiv.’  v-w*  -i  -» 


One  very  nice  property  of  the  B-splines  B^  on  crisscross 

triangulations  reported  in  [3]  is  that  they  are  very  flexible,  in  the 
sense  that  their  graphs  are  continuous  with  respect  to  the  lines  x  -  * 

0  and  y  -  y,  =  0  as  shown  in  the  pictures  on  pages  880  and  881  in  13]. 

**  1 

This  property  is  not  enjoyed  by  the  C  cubic  splines  on  unidiagonal 
triangulations  as  observed  in  [5].  More  results  have  now  been  found  and 


it  is  the  purpose  of  this  article  to  report  some  of  the  unusual  properties 


these  smooth  cubic  splines  on  unidiagonal  triangulations. 


Fig.  la  Fig  lb 


2.  SMOOTH  CUBIC  B-SPLINES.  In  this  section  we  only  consider 
unldlagonal  tricngulation  for  the  rectangle  D  as  described  above.  Inthe 
equally  spaced  setting,  i.e.  on  a  3-directional  mesh,  there  are  two  C1 
cubic  B-splines  where  one  is  a  180°-rotation  of  the  ether.  The  support 
of  one  of  them  is  given  in  Fig.  la.  In  general,  the  grid  configuration  of 
this  support  would  look  like  the  one  shown  in  Fig.  lb.  Let  us  call  this 
grid  configuration  A^ .  It  was  observed  in  [5]  that  A^  is  the 

support  of  a  nontrivial  bivariate  Cl  cubic  spline  if  and  only  if 


940 


m 


jrV 


progressions.  Suppose 


1  LTV  tTl  I'Um.Vl’VlAii: 


It  [6]  the  "unique"  cubic  B-spline  supported  by  A^  was  constructed  as 
shown  in  Fig.  2.  We  remark  here  that  A^  may  happen  to  be  concave.  A 

simple  example  is  given  by  setting  p  =  -j  and  q  =  1  as  shown  in  Fig.  3a 

and  a  picture  of  the  3-spl  ine  it  supports  is  given  ir.  Fig  3b.  Suppose  for 
the  time  being  that  both  the  horizontal  and  vertical  spacings  of  the 
entire  original  rectangular  grid  partition  of  D  are  geometric 
progressions.  It  is  perhaps  surprising  to  note  that  these  B-splines  do 
not  necessarily  produce  constants.  More  precisely,  if  one  of  the 
geometric  ratios  p  and  q  is  different  from  one,  then  the  constant  1  is 
not  a  linear  combination  of  these  R-solines. 

If  the  lines  x  -  x.  =0  and  y  -  y,  =■  0  are  arbitrarily  given, 

1  ^ 

then  the  supports  of  the  bivariate  C  cubic  B-splines,  if  they  exist, 

have  to  increase.  In  fact  the  sizes  and  shapes  of  the  B-splines  depend 

very  heavily  on  the  sequences  {x  -  x  }  and  {y  -  y  }.  These  and 

1+1  I  j+1  j 

other  related  results  will  be  discussed  in  a  forthcoming  joint  paper  with 
my  student  G.  Chen. 
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A  MODEL  FOR  ASYNCHRONOUS  DISTRIBUTED  COMPU f AT  I CM* 


Dimitri  F*.  Bertsekas 

Laboratory  for  Information  and  Decision  Systems 
Department  of  Electrical  Engineering  and  Decision  Systems 
Massachusetts  Institute  of  Technology 
Cambridge,  Massachusetts  02139 

ABSTRACT.  We  present  an  algorithmic  model  for  distributed 
co.tiput  at-  ion  of  fixed  points  whereby  sewer  a  1  processors 
pa  r  tici  pa  t  e  si  mu  1 1  aneous  1  y  i  n  t  he  cal  cu  1  at  i  ons  wh  i  1  e  exch  .<ng  l  ng 
information  via  coriimunicat  ion  links.  We  place  essentially  no 
assumptions  on  the  ordering  of  computation  and  communication 
between  processors  thereby  allowing  for  completely  uncoordinated 
execution.  We  find  that  even  under  these  potentially  chaotic 
circumstances  it  is  possible  to  solve  several  important  classes 
of  problems  including  the  calculation  of  fixed  points  of 
contraction  and  monotone  mappings  arising  in  linear  and  nonlinear 
systems  of  equations,  shortest  path  problems,  and  dynamic 
pr ogramming. 

1.  I HTRQDUCT I OH .  There  is  presently  a  great  deal  ot 
interest  in  distributed  implementations  of  various  iterative 
algorithms  whereby  the  computational  load  is  shared  by  several 
pr  ocessors  wh i  1  e  coord i  nat  i  on  i  s  ma  i nt.  a  i  ned  by  i  n  format  i  on 
exchange  via  communication  links.  In  most  of  the  work  done  in 
i  his  area  the  starting  point  is  some  iterative  algorithm  which  is 
guaranteed  to  converge  to  the  correct  solution  under  the  usual 
oi r cumstances  of  centralized  computation  in  a  single  processor. 
The  computational  load  of  the  typical  iteration  is  then  divided 
in  some  way  between  the  available  processors ,  and  it  is  assumed 
th.it  the  processors  exchange  all  necessary  informat  ion  regarding 
the  outcomes  of  the  current  iteration  before  a  new  iteration  can 
beg i n . 


The  mode  of  operation  described  - bove  may  be  termed 
synchronous  in  the  sense  that  each  p>  -icesso r  must  complete  its 
assigned  portion  of  an  iteration  and  communicate  the  results  to 
every  other  processor  before  a  new  iteration  can  begin.  This 
assumption  certainly  enhances  the  orderly  operation  of  the 
algorithm  and  greatly  simplifies  the  convergence  anayris.  On  the 
other  hand  synchronous  distributed  algorithms  also  have  some 
obvious  disadvantages  such  as  the  need  for  an  algorithm 
initiation  and  iteration  synehronizat ion  protocol.  Fu,  thermore 
♦he  speed  of  computation  is  limited  to  that  of  the  slowest 
processor.  It  is  thus  interesting  to  consider  algorithms  that 
can  tolerate- a  more  flexible  ordering  of  computation  and 
communicat ion  between  processors.  Such  algorithms  have  so  far 
found  applications  in  computer-  communication  networks  like  the 
hRPhHET  C1D  where  processor  failures  are  common  and  it  is  quite 
cotiipl  ioated  to  maintain  synehronizat  ion  between  the  nodes  of  the 
entire  network  as  they  execute  real-time  twork  functions  such 
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as  the  routing  algorithm 


Processor  network  environments  for  which  weakly  coordinated 
d i st  r i but ed  cornput  at i on  seems  part i cu 1 ar 1 y  advant  ageous  t  yp i ea 1 1 y 
possess  one  or  more  of  the  following  charaeterist ics  all  of  which 
involve  occur ance  of  some  type  of  unpredictable  event, 

1 >  Computation  nodes  and  communication  links  are  subject  to 
frequent  and/or  unexpected  failures.  CFor  example  packet  radio 
networks. > 

2>  Computation  nodes  have  different  and/or  time  varying  speeds 
of  execution.  <For  example  each  processor  is  assigned  to  a 
perhaps  time  varying  number  of  tasks  involving  computation  loads 
which  arc  not  fixed  a  priori.> 

35  Computation  at  various  nodes  is  event  driven.  cp nr  example 
in  data  collection  or  sensor  networks  where  the  timing  and 
ordering  uf  measurements  may  not  be  predictable. > 

It  is  possible  to  consider  various  degrees  of  coordination 
in  different  types  of  distributed  algorithms.  An  interesting 
question  is  to  determine  the  minimum  degree  of  coordination 
needed  in  a  given  algorithm  in  order  to  obtain  the  correct 
solution.  To  this  end  we  consider  an  extreme  model  of 
uncoordinated  distributed  algorithms  whereby  computation  and 
communication  are  performed  at  each  processor  completely 
independent ly  of  the  progress  in  other  processors.  It  is  perhaps 
surprising  that  even  under  these  chaotic  circumstances  it  is 
still  possible  to  solve  correctly  important  classes  of  fixed 
point  problems.  The  complete  analysis  is  given  in  C23  for  broad 
classes  of  dynamic  programming  and  in  E3D  for  more  general  fixed 
point  problems  involving  contraction  and  monotonicity 
assumptions.  Further  related  work  is  C53  and  L'63. 

2 .  A  Model  for  Distributed  Uncoordinated  Fixed  Point  Algorithms 

The  fixed  point  problem  considered  in  this  paper  is  defined 
in  terms  of  a  set  X?  a  class  F  of  functions  mapping  X  ir.to  the 
extended  real  line  [-«■,+<»  3,  and  a  mapping  T  which  maps  F  into 
itself.  We  wish  to  find  an  element  J*  of  F  such  that 


j*  =  T<J*> 


<1> 


or  equivalently 


<2> 


T<J*Xx>» 


y  xeX, 
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where  J  +■  <. x }  and  T >  denote  the  values  of  the  functions  J+ 
and  I  respectively  at  the  typical  element  xsX.  Ue  will 

assur  throughout  that  T  has  a  unique  fixed  point  within  the 
set  F. 

We  provide  some  examples: 

Exarnp  1  e  1 :  <Fixed  points  of  mappings  on  Rn>.  Let  X  be  the 

finite  set 


X  —  {  1  i  2)  t  •  •  >  Illy 


and  F  be  the  set  of  all  real -valued  functions  on  X. 

Then  F  can  be  identified  with  the  n-dimensional  space  Fsn  in  the 
sense  that  with  each  JsF  we  can  associate  the  n-dimensional 
vector  J  ■.?}»•>..,  J<n>5.  Similarly  T<J>  can  be  identified 

with  the  n-dimensional  vector  . . .  so  the  fi.-ed 

point  problem  <1>  amounts  to  solving  the  system  of  n  equations 


J*  =  T<J*}  or  J*< i >  =  T<J*><i>,  y  i  =  <3> 

with  the  n  unknowns  J*».l}» . . . >  J*<n>.  It  is  also  evident  that  any 
system  of  n  < possibly  non I i near >  equations  with  n  unknowns  can  be 
formulated  into  a  fixed  point  problem  such  as  <3>. 

Example  2:  < Shortest  path  problems}.  Let  <H, L>  be  a  directed 

graph  where  H  =  f 1 > 2, . . . » n>  denotes  the  set  of  nodes  and  L 
denotes  the  set  of  links.  Let  N<i>  denote  the  downstream 
neighbors  of  node  i»  i.e.»  the  set  of  nodes  j  for  which  <i,j>  is 
a  link.  Assume  that  each  link  <i,j>  is  assigned  a  positive 
scalar  a^,  j  referred  to  as  its  length.  Assume  also  that  there  is 
a  directed  path  to  node  1  from  every  other  node.  Then  it  is 
known  <[4]*  p.  67 >  that  the  shortest  path  distances  J=T:<i>  to  node 
1  from  all  other  nodes  i  solve  uniquely  the  equations 
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3*<i? 


1 


<4*> 


=  min  <a<  *  +  3*<  j>>?  i 
jsN<i> 


J*<1>  = 


<4b> 


If  we  make  the  identifications  X  =  -C  1 » 2> . . .  » n>»  F*  Set  of  all 
functions  mapping  X  into  <©*+«*3»  and  define  T<J>  for  ail  JsF  by 
means  of 

min  C<Li  3  +  J< j>>  if  i  *  1 

4«N<i>  J 

8  if  i  =  1  <3> 


then  we  find  that  the  fixed  point  problem  (2>  reduces  to  the 
shortest  path  problem. 

The  shortest  path  problem  above  is  representat ive  of  a  broad 
class  of  dynamic  programming  problems  which  can  be  viewed  as 
special  cases  o?  the  fixed  point  problem  <2>  and  can  be  correctl 
solved  by  using  the  distributed  algorithms  of  this  paper  <see 
[:?•]>. 

Our  algorithmic  model  can  be  described  in  terms  of  a 
collection  of  n  computation  centers  Cor  processors?  referred  to 
as  nodes  and  denoted  1?1?.’.  .?n.  The  set  X  is  partitioned  into  n 
disjoint  sots  denoted  Xj»...»Xn,  i.e. 


n 

X  =  u  X, ,  X,  n  X,  =  8,  if  i  j . 

i  =  l 


Each  node  i  is  assigned  the  response  j  1  ity  of  comput  i ng  the 
values  of  the  solution  function  J*  Cc. f .  <!>»  <2>3  at  all  xsX^. 

At  each. time  instant?  node  i  can  in  be  one  of  three  possible 
states  compute,  transmit .  or  idle.  In  the  compute  state  node  i 
computes  a  new  estimate  of  the  values  of  the  solution  function  J* 
for  all  xsX^.  In  the  transmit  state  node  i  communicates  the 
estimate  obtained  from  the  latest  computation  to  one  or  more 
nod»s  j  Cjs^i?.  in  the-  idle  sinie  node  i  does  nothing  related  to 
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nr  vr-JT' 


t-he  solution  of  the  problem.  It  is  assumed  that  a  node  can 
receive  a  transmission  from  other  nodes  simultaneously  with 
comput.  ing  or  transmit*  ing.  We  assume  that  computation  and 
transmission  for  each  node  takes  place  in  uninterrupted  time 
intervals  Ct.jt^^  with  tj  <  t2,  but  do  not  exclude  the 
possibility  that  a  node  may  be  simultaneously  transmitting  to 
more  than  one  nodes  nor  do  we  assume  that  the  transmission 
intervals  to  these  nodes  have  the  same  origin  and/or  termination. 
Me  also  make  no  assumptions  on  the  iengtn,  timing  and  sequencing 
of  computation  and  transmission  intervals  other  than  the 
foil ow i ng : 


ftsauinpl  1  on  <  A  > :  There  exists  a  positive  scalar  P  such  that?  for 
eve»*y  node  i,  every  time  interval  of  length  P  contains  at  least 
one  computation  interval  for  j  and  at  least  one  transmission 
interval  from  i  to  each  node  j  ^  i. 

Each  node  i  also  has  a  buffer  Ehj  for  each  .j  =*  j  where  it 
stores  the  latest  transmission  from  3i>  as  well  as  a  buffer  B^j 
where  it.  stores  its  own  estimate  of  values  of  the  solution 
function  for  all  The  contents  of  each  buffer  Bj  j  at  time  t 

are  denoted  Oftt j «  Thus  jlj  is,  for  every  t»  a  function  from  >»< 
into  and  may  be  viewed  as  the  estimate  by  node  i  of  the 

restriction  of  the  solution  function  J*  on  Xj  available  at  time 
t.  The  rules  according  to  which  the  functions  7**^  are  updated 
are  as  follows! 


*>  If  Ctj,t2^  is  a  transmission  i  it-erval  from  node  j  to  node  i 
the  contents  Jjj  of  the  buffer  B^  at  time  tj  are  transmitted  and 
entered  in  the  buffer  Bjj  at  time  t2»  i.e. 


<6> 


2>  If  Ctj,t23  is  a  computation  interval  for  node  i  the  contents 
of  buffer  B^j  at  time  t2  are  replaced  by. the  restriction  of  the 
function  TCJpon  Xj  where,  for  all  t,  Jps  defined  by 


J*;'<x> 

1 


S3 


<x> 

<x> 


if  xsXA 

if  xsX^,  j  j*  i. 


In  other  words  we  have 


<7> 


3>  The  contents  of  &  buffer  Bjj  can  change  only  at  the  end  c>»  a 
computation  interval  for  node  i,  The  contents  of  a  buffer  ;•  • 
i  can  change  only  at  the  end  of  a  transmission  interval  from  j 
to  i« 


Additional  conditions  under  which  there  holds 


lim  Jj<x>  >=  J*<x>,  y  xcX.,  i  »  l,...,n 

ta.»  1  1 


may  be  found  in  C23,  C31.  An  interesting  aspect  of  results  of 
this  type  is  that  they  do  not  require  that  the  initial  processoi 
buffer  contents  be  identical  and  indeed  these  initial  conditions 
can  vary  within  a  broad  range.  This  means  that  for  problems  that 
are  being  solved  continuously  in  real  time  it  is  not  necessary  to 
reset  the  initial  conditions  and  resynchronize  the  algorithm  each 
time  the  problem  data  changes.  As  a  result  the  potential  for 
tracking  slow  variations  on  the  solution  function  is  improved  and 
algorithmic  implementation  is  considerably  simplified. 


*This  research  was  conducted  at  the  M.I.T.  Laboratory  for 
Information  and  Decision  Systems  with  partial  support  provided  by 
the  Defense  Advanced  Projects  Agency  under  Contract  Ho.  OHR- 
H09014-75-C-1  183  and  Army  Research  Office  under  Contract  Ho. 
DAAG29-84-K-0005. 
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